WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, C07K 14/47, C12N 15/62, 
G01N 33/68, C12Q 1/68, C07K 16/18 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/36537 

22 July 1999 (22.07.99) 



(21) International Application Number: PCT/US99/00182 

(22) International Filing Date: 14 January 1999 (14.01.99) 



(30) Priority Data: 






60/071,374 


15 January 1998 (15.01.98) 


US 


60/093,491 


20 July 1998 (20.07.98) 


US 


60/110,941 


4 December 1998 (04.12.98) 


US 


09/232,197 


14 January 1999 (14.01.99) 


US 


09/232,200 


14 January 1999 (14.01.99) 


US 


09/232,201 


14 January 1999(14.01.99) 


US 


09/232,195 


14 January 1999 (14.01.99) 


us 



(71) Applicants (for all designated States except US): WHITE- 

HEAD INSTITUTE FOR BIOMEDICAL RESEARCH 
[US/US]; Nine Cambridge Center, Cambridge, MA 02142 
(US). MILLENNIUM PHARMACEUTICALS, INC. 
[US/US]; 238 Main Street, Cambridge, MA 02142 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): STAHL, Andreas 
[DE/US]; 12 Farrington Avenue #2, Allston, MA 02134 
(US). HIRSCH, David, J. [US/US]; 120 Amory Street 
#5, Brookline, MA 02146 (US). LODISH, Harvey, F. 
[US/US]; 195 Fischer Avenue, Brookline, MA 02146 (US). 
GIMENO, Ruth, E. [DE/US]; 65 Beverly Road, Wellesley, 



MA 02481 (US). TARTAGLIA, Louis, A. [US/USJ; 32 
Manor House Road, Newton, MA 02159 (US). 

(74) Agents: GRANAHAN, Patricia et al.; Hamilton, Brook, Smith 
& Reynolds, P.C., Two Militia Drive, Lexington, MA 02421 
(US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES. FI, GB, GD, 
GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, 
KR, KZ, LC, LK, LR, LS. LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO. NZ, PL, PT, RO, RU, SD, SE, SG. 
SI, SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, 
ZW, AR1PO patent (GH, GM, KE, LS, MW, SD, SZ, UG, 
ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, CY, DE, DK, ES, FI, 
FR, GB, GR, IE, IT. LU, MC, NL, PT, SE), OAPI patent 
(BF, BJ. CF, CG. CI. CM, GA, GN, GW, ML. MR, NE, 
SN, TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: FATTY ACID TRANSPORT PROTEINS 



(57) Abstract 

A family of fatty acid transport proteins (FATPs) mediate transport of long chain fatty acids (LCFAs) across cell membranes into 
cells. These proteins exhibit different expression patterns among the organs of mammals. Nucleic acids encoding FATPs of this family, 
vectors comprising these nucleic acids, as well as the production of FATP proteins in host cells are described. Also described are methods 
to test FATPs for fatty acid transport function, and methods to identify inhibitors or enhancers of transport function. The altering of LCFA 
uptake by administering to the mammal an inhibitor or enhancer of FATP transport function of a FATP in the small intestine can decrease 
or increase calories available as fats, and can decrease or increase circulating fatty acids. The organ specificity of FATP distribution can 
be exploited in methods to direct drugs, diagnostic indicators and so forth to an organ such as the heart. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spam 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senega) 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


COte d'lvorre 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/36537 



PCT/US99/00182 



-1- 



FATTY ACID TRANSPORT PROTEINS 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application 
Number 60/071,374 entitled "Identification of a Family of Fatty Acid Transporters 
5 Conserved From Mycobacterium to Man," by Andreas Stahl, David Hirsch and 
Harvey F. Lodish, filed on January 15, 1998; U.S. Provisional Application Number 
60/093,491 entitled "Fatty Acid Transport Proteins," by Andreas Stahl, David J. 
Hirsch, Harvey F. Lodish, Ruth E. Gimeno and Louis A. Tartaglia, filed on July 20, 
1998; and U.S. Provisional Application Number 60/1 10,941 entitled "Fatty Acid 

10 Transport Proteins," by Andreas Stahl, David J. Hirsch, Harvey F. Lodish, Ruth E. 
Gimeno and Louis A. Tartaglia, filed on December 4, 1998. This application also 
claims priority to Attorneys Docket Nos. WHI97-21p3MA, WHI97-21p3MB, 
WHI97-21p3MC, WHI97-21p3MD, each of which is entitled "Fatty Acid Transport 
Proteins," by Andreas Stahl, David J. Hirsch, Harvey F. Lodish, Ruth E. Gimeno 

15 and Louis A. Tartaglia, filed on January 14, 1999. The teachings of each of these 
referenced applications are incorporated herein by reference in their entirety. 

GOVERNMENT SUPPORT 

The invention was supported, in whole or in part, by National Institutes of 
Health Grant DK 4761 8 and National Institutes of Health Grant 5 T32 CA 09541 . 
20 The United States Government has certain rights in the invention. 
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B.ACKGROUND OF THE INVENTION 

Long chain fatty acids (LCFAs) are an important source of energy for most 
organisms. They also function as blood hormones, regulating key metabolic functions 
such as hepatic glucose production. Although LCFAs can diffuse through the 
5 hydrophobic core of the plasma membrane into cells, this nonspecific transport cannot 
account for the high affinity and specific transport of LCFAs exhibited by cells such as 
cardiac muscle, hepatocytes, enterocytes, and adipocytes. The molecular mechanisms 
of LCFA transport remains largely unknown. Identifying these mechanisms can lead to 
pharmaceuticals that modulate fatty acid uptake by the intestine and by other organs, 
10 thereby alleviating certain medical conditions (e.g. obesity). 

SUMMARY OF THE INVENTION 

Described herein is a diverse family of fatty acid transport proteins (FATPs) 
which are evolutionarily conserved; these FATPs are plasma membrane proteins which 
mediate transport of LCFAs across the membranes and into cells. Members of the 

1 5 FATP family described herein are present in a wide variety of organisms, from 

mycobacteria to humans, and exhibit very different expression patterns in tissues among 
the organisms. FATP family members are expressed in prokaryotic and eukaryotic 
organisms and comprise characteristic amino acid domains or sequences which are 
highly conserved across family members. In addition, the function of the FATP gene 

2 0 family is conserved throughout evolution, as shown by the fact that the Caenorhabditis 
(C). elegans and mycobacterial FATPs described herein facilitate LCFA uptake when 
they are overexpressed in COS cells or Escherichia (E.) coli, respectively. FATPs are 
expressed in a wide variety of tissues, including all tissues which are important to fatty 
acid metabolism (uptake and processing). 

25 hi specific embodiments, FATPs of the present invention are from such diverse 

organisms as humans (Homo (H.) sapiens), mice, (Mus (M.) musculus), F. rubripes, C. 
elegans, Drosophila (D.) melanogaster, Saccharomyces (S.) cerevisiae, Aspergillus 
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nidulans, Cochliobolu heterostrophus, Magnaporthe grisea and Mycobacterium (M.) t 
such as M. tuberculosis. As described herein, four novel mouse FATPs, referred to as 
mmFATP2, mmFATP3, mmFATP4 and mmFATPS, and six human FATPs, referred to 
as hsFATPl, hsFATP2, hsFATP3, hsFATP4, hsFATPS and hsFATP6, have been 
5 identified. All four novel murine FATPs (mmFATP2-5) and a previously identified 
murine FATP (renamed herein FATP1) have orthologs in humans (hsFATPl-5); the 
sixth human FATP (hsFATP6) does not as yet have a mouse ortholog. The expression 
patterns of these FATPs vary, as described in detail below. 

The present invention relates to FATP family members from prokaryotes and 

10 eukaryotes, nucleic acids (DNA, RNA) encoding FATPs, and nucleic acids which are 
useful as probes or primers (e.g., for use in hybridization methods, amplification 
methods) for example, in methods of detecting FATP-encoding genes, producing 
FATPs, and purifying or isolating FATP-encoding DNA or RNA. Also the subject of 
this invention are antibodies (polyclonal or monoclonal) which bind an FATP or 

1 5 FATPs; methods of identifying additional FATP family members (for example, 

orthologs of those FATPs described herein by amino acid sequence) and variant alleles 
of known FATP genes; methods of identifying compounds which bind to an FATP, or 
modulate or alter (enhance or inhibit) FATP function; compounds which modulate or 
alter FATP function; methods of modulating or altering (enhancing or inhibiting) FATP 

2 0 function and, thus, LCFA uptake into tissues of a mammal (e.g. human) by 

administering a compound or molecule (a drug or agent) which increases or reduces 
FATP activity; and methods of targeting compounds to tissues by administering a 
complex of the compound to be targeted to tissues and a component which is bound by 
an FATP present on cells of the tissues to which the compound is to be targeted. For 

2 5 example, a complex of a drug to be delivered to the liver and a component which is 
bound by an FATP present on liver cells (e.g., FATPS) can be administered. 

In one embodiment, the present invention relates to modulating or altering 
(enhancing or inhibiting/reducing) LCFA uptake in the small intestine and, thus, 
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increasing or reducing the number of calories in the form of fats available to an 
individual. In another embodiment, the present invention relates to inhibiting or 
reducing LCFA uptake in the small intestine in order to reduce circulating fatty acid 
levels; that is, LCFA uptake in the small intestine is reduced, and, therefore, circulating 
5 (blood) levels are not as high as they otherwise would be. FATP4 has been shown to be 
expressed in epithelial cells of the small intestine and particularly in the brush border 
layer of the small intestine. FATP2 has also been shown to be expressed at low levels 
in epithelial cells of the small intestine, particularly in the duodenum. In contrast, 
FATP1, FATP3, FATP5 and FATP6 were not detected in any of the intestinal tissues. 

10 Thus, also described herein are FATPs which are present in the epithelial cell layer of 
the small intestine where they mediate LCFA uptake. These FATPs, particularly 
FATP4 and also FATP2, are targets for methods and drugs which block their function 
or activity and are useful in treating obesity, diabetes and heart disease. The ability of 
these FATPs to mediate fat uptake can be modulated or altered (enhanced or inhibited), 

1 5 thus modulating fat uptake in the small intestine. This can be done, for example, by 
administering to an individual, such as a human or other animal, a ding which blocks 
interaction of LCFAs with FATP4 and/or FATP2 in the small intestine, thus inhibiting 
LCFA passage into the cells of the small intestine. As a result, fat absorption is reduced 
and, although the individual has consumed a certain quantity of fat, the LCFAs are not 

20 absorbed to the same extent they would have been in the absence of the compound 
administered. 

Thus, one embodiment of this invention is a method of reducing LCFA uptake 
(absorption) in the small intestine and, as a result, reducing caloric uptake in the form of 
fat. A further embodiment is a compound (drug) useful in inhibiting or reducing fat 
2 5 absorption in the small intestine. In another embodiment, the invention is a method of 
reducing circulating fatty acid levels by administering to an individual a compound 
which blocks interactions of LCFAs with FATP4 and/or FATP2 in the small intestine, 
thus inhibiting LCFA passage into cells of the small intestine. As a result, fatty acids 
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pass into tlie circulatory system at a diminished level and/or rate, and circulating fatty 
acid levels are lower than they would be in the absence of the compound administered. 
This method is particularly useful for therapy in individuals who are at risk for or have 
hyperlipidemia. That is, it can be used to prevent the occurrence of elevated levels of 
5 lipids in the blood or to treat an individual in whom blood lipid levels are elevated. 
Also the subject of this invention is a method of identifying compounds which alter 
FATP function (and thus, in the case of FATP2 and/or FATP4, alter LCFA uptake in 
the small intestine). 

In another embodiment, the present invention relates to a method of modulating 

10 or altering (enhancing or inhibiting) the function of FATP6, which is expressed at high 
levels in the heart. A method of inhibiting FATP6 function is useful, for example, in 
individuals with heart disease, such as ischemia, since reducing LCFA uptake into heart 
muscle in an individual who has ischemic heart disease, which may be manifested by, 
for example, angina or heart attack, can reduce symptoms or reduce the extent of 

1 5 damage caused by the ischemia. In this embodiment, a drug which inhibits FATP6 
function is administered to an individual who has had or is having a heart attack, to 
reduce LCFA uptake by the individual's heart and, as a result, reduce the damage caused 
by ischemia. In a further embodiment, this invention is a method of targeting a 
compound, such as a therapeutic drug or an imaging reagent, to heart tissue by 

2 0 administering to an individual (e.g., a human) a complex of the compound and a 

component (e.g., a LCFA or LCFA-like compound) which is bound by an FATP (e.g., 
FATP6) present in cells of heart tissue. 

In a further embodiment, LCFA uptake by the liver is modulated or altered 
(enhanced or reduced), in an individual. For example, a drug which inhibits the 

25 function of an FATP present in liver (e.g., FATP5) is administered to an individual who 
is diabetic, in order to reduce LCFA uptake by liver cells and, thus reduce insulin 
resistance. 
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The present invention, thus, provides methods which are useful to alter, 
particularly reduce, LCFA uptake in individuals and, as a result, to alter (particularly 
reduce), availability of the LCFAs for further metabolism. In a specific embodiment, 
the present invention provides methods useful to reduce LCFA uptake and, thus, fatty 
5 acid metabolism in individuals, with the result that caloric availability from fats is 

reduced, and circulating fatty acid levels are lower than they otherwise would be. These 
methods are useful, for example, as a means of weight control in individuals, (e.g., 
humans) and as a means of preventing elevated serum lipid levels or reducing serum 
lipid levels in humans. FATPs expressed in the small intestine, such as FATP4, are 

10 useful targets to be blocked in treating obesity (e.g., chronic obesity) or to be enhanced 
in treating conditions in which enhanced LCFA uptake is desired (e.g., malabsorption 
syndrome or other wasting conditions). 

The identification of this evolutionarily conserved fatty acid transporter family 
will allow a better understanding of the mechanisms whereby LCFAs traverse the lipid 

15 bilayer as well as yield insight into the control of energy homeostasis and its 
dysregulation in diseases such as diabetes and obesity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the amino acid sequence alignment of FATPs: mmFATPl (SEQ 
ID NO:92), mmFATP2 (SEQ ID NO:93), mmFATP3 (SEQ ID NO:94), mmFATP4 

2 0 (SEQ ID NO:95), mmFATP5 (SEQ ID NO:96), ceFATPa (SEQ ID NO:97), scFATP 
(SEQ ID NO:98) and mtFATP (SEQ ID N0.99). The underlining (amino acid residues 
204-212 of mtFATP) indicates an AMP binding motif which is found in many classes 
of proteins; the underlining at amino acid residues 204-507 of the mtFATP sequence 
indicates the FATP 360 amino acid signature sequence. 

2 5 Figures 2A-2D show results of LCFA uptake assays. Figures 2A-2D: COS 

cells were cotransfected using the DEAE-dextran method with the mammalian 
expression vectors pCDNA-CD2 either alone (control; Figure 2 A) or in combination 
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with one of the FATP-containing expression vectors (pCDNA-mmFATPl, Figure 2B; 
pCDNA-mmFATP2, Figure 2C; or pCMV-SPORT2-mmFATP5, Figure 2D) as 
described in Materials and Methods for Example 2. COS cells were gated on forward 
scatter (FSC) and side scatter (SS), and the results shown represent >1 0,000 cells. Cells 
5 exhibiting >300 CD2 fluorescence units (vertical line) representing 1 5% of all cells 
were deemed CD2 positive. 

Figure 3 is a graph of fluorescence of cells expressing a FATP gene. As in 
Figures 2A-2D, COS cells were cotransfected with pCDNA-CD2 either alone (control) 
or in combination with one of the FATP-containing expression vectors (pCDNA- 

1 0 mmF ATP 1 , pCDN A-mmFATP2, pCMV-SPORT2-mmFATP5, or pCDNA-ceF ATPb). 
The mean BODIPY-FA fluorescence of the CD2-positive cells is plotted; results shown 
represent the average of three experiments, each consisting of greater than 50,000 COS 
cells. Note that a logarithmic scale is used on the ordinate. 

Figure 4 is a graph of the uptake of palmitate with time. The full-length coding 

15 region of mtFATP (squares) or a control protein (TFE3; circles) was subcloned into the 
inducible, prokaryotic expression vector pET (Novagen). Expression from the resulting 
plasmid was induced (solid symbols) in transformed E. coli cells with 1 mM isopropyl- 
P-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced (open symbols). 
Data points were done in triplicate and counts were normalized to the number of 

2 0 bacteria as determined by OD 600 . 

Figure 5 is a phylogenetic tree produced by aligning complete and partial 
sequences for FATP genes from human, rat, mouse, puffer fish, D. melanogaster, C. 
elegans, S. cerevisiae, and M. tuberculosis using ClustalX and using these data to 
produce a phylogenetic tree using TreeViewPPC. The bar indicates the number of 

25 substitutions per residue, i.e., 0.1 corresponds to a distance of 10 substitutions per 100 
residues. 



• 
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Figure 6 shows a comparison of the FATP signature sequences of mtnFATPl. 
(SEQ ID NO:l), mmFATP5, (SEQ ID NO:2), ceFATPa (SEQ ID NO:3), scFATP (SEQ 
ID NO:4) and mtFATP (SEQ ID NO:5). 

Figure 7 shows the sequence identity among the FATP family members and 
5 VLACs, based on the 360 amino acid signature sequence of FATP from Figure 1 . 

Figures 8A and 8B are the mmFATP3 DNA sequence (SEQ ID NO:6). 

Figure 9 is the mmFATP3 protein sequence (SEQ ID NO:7). 

Figures 10A and 10B are the mmFATP4 DNA sequence (SEQ ID NO: 8). 

Figure 1 1 is the mmFATP4 protein sequence (SEQ ID NO:9). 
1 0 Figures 1 2 A and 1 2B are the mmFATPS DNA sequence (SEQ ID NO : 1 0). 

Figure 13 is the mmFATP5 protein sequence (SEQ ID NO:l 1). 

Figures 14A and 14B are the hsFATP2 DNA sequence (SEQ ID NO: 12). 

Figure 15 is the hsFATP2 protein sequence (SEQ ID NO: 13). 

Figures 16A and 16B are the hsFATP3 DNA sequence (SEQ ID NO: 14). 
1 5 Figure 1 7 is the hsFATP3 protein sequence (SEQ ID NO: 15). 

Figures 18 A and 18B are the hsFATP4 DNA sequence (SEQ ID NO: 16). 

Figure 19 is the hsFATP4 protein sequence (SEQ ID NO: 17). 

Figures 20A and 20B are the hsFATP5 DNA sequence (SEQ ID NO: 18). 

Figure 21 is the hsFATP5 protein sequence (SEQ ID NO: 19). 
2 0 Figures 22A and 22B are the hsFATP6 DNA sequence (SEQ ID NO:20). 

Figure 23 is the hsFATP6 protein sequence (SEQ ID NO:21). 

Figures 24A and 24B are the mtFATP DNA sequence (SEQ ID NO:22). 

Figure 25 is the mtFATP protein sequence (SEQ ID NO:23). 

Figure 26 shows the DNA sequence (SEQ ID NO:24) and predicted amino acid 
2 5 sequence (SEQ ID NO:25) of human FATP 1 . 

Figure 27 shows the DNA sequence (SEQ ID NO:26) and predicted amino acid 
sequence (SEQ ID NO: 27) of human FATP4. 
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Figure 28A.is a hydrophobicity plot for hsFATPl, showing that it has multiple 
membrane-spanning domains. 

Figure 28B is the amino acid composition of hsFATPl. 

Figure 28C is a hydrophilicity plot for hsFATPl, made using the Kyte-Doolittle 
5 method, averaging hydrophilicity values for 1 8 amino acid residues at a time. 

Figure 29A is a hydrophobicity plot for hsFATP4, showing that it has multiple 
membrane-spanning domains. 

Figure 29B is a listing of the amino acid composition of hsFATP4. 

Figure 29C is a hydrophilicity plot for hsFATP4, made using the Kyte-Doolittle 
10 method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figures 30A and 30B show a comparison of the nucleotide sequence of human 
FATP1 (SEQ ID NO:28) and the nucleotide sequence of mouse F ATP 1 (SEQ ID 
NO:29). 

Figures 31 A and 3 IB show a comparison of the nucleotide sequence of human 
15 FATP4 (SEQ ID NO:30) and the nucleotide sequence of mouse FATP4 (SEQ ID 
NO:31). 

Figure 32 shows a comparison of the amino acid sequence of human FATP1 
(SEQ ID NO:32) and the amino acid sequence of mouse FATP1 (SEQ ID NO:33). 
Shaded amino acid residues match the concensus sequence exactly 
2 0 Figure 33 shows a comparison at the amino acid level of human FATP4 (SEQ 

ID NO:34) and mouse FATP4 (SEQ ID NO:35). Shaded amino acid residues match the 
concensus sequence exactly. 

Figure 34 shows the nucleotide sequence (SEQ ID NO:36) and predicted amino 
acid sequence (SEQ ID NO:37) of hsFATP6. 
2 5 Figure 35 A is a hydrophobicity plot for hsFATP6, showing that it has multiple 

membrane-spanning domains. 

Figure 35B is a listing of the amino acid composition of hsFATP6. 
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Figure 35C is a hydrophilicity plot for hsFATP6, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 36 shows an alignment of the amino acid sequences of hsFATPl (SEQ 
ID NO:38), hsFATP4 (SEQ ID NO:39) and hsFATP6 (SEQ ID NO:40). Shaded amino 
5 acid residues match the concensus sequence exactly. 

Figure 37 shows results of assessment of fatty acid uptake by human FATP1 and 
human FATP4. The percent of CD2-positive cells exhibiting a BODIPY-fluorescence 
of more than 300 arbitrary units is plotted for the three different conditions tested. 

Figure 38 is a graph showing uptake of tritiated oleate, with time, by 293 cells 
1 0 transfected with either (diamonds) a plasmid for expression of human FATP4 or 
(squares) a control plasmid. 

Figure 39 is an illustration of the amino acid sequences of human FATP4 (SEQ 
ID NO:41) and mouse FATP4 (SEQ ID NO:42) compared to human FATP1 (SEQ ID 
NO:43). Shown by underlining are the FATP consensus sequence (236-556 of 
15 hsFATPl) and the AMP-binding motif (246-254 of hsFATPl). The human FATPs 
were cloned by screening libraries with sequences from ESTs (expressed sequence 
tags). Mouse FATP4 was cloned by PCR using degenerate primers. 

Figure 40 is a graph showing the uptake, with time, of tritiated oleate by mouse 
enterocytes in the presence of no oligonucleotide (squares), sense oligonucleotide 
2 0 (circles) or antisense oligonucleotide (diamonds). 

Figure 41 is a bar graph showing uptake of tritiated oleate, by mouse enterocytes 
in the presence of various concentrations of antisense (solid bars), mismatch (stippled 
bars) or sense (lined bars) oligonucleotides. 

Figure 42 is a bar graph showing uptake of tritiated oleate and uptake of 35 S- 
2 5 labeled methionine by mouse enterocytes to which were added no oligonucleotide, the 
antisense oligonucleotide, or the mismatch oligonucleotide. 

Figure 43 A is the nucleotide sequence of the gene encoding mouse FATP4 (SEQ 
ID NO:44). 
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Figure 43B is the amino acid sequence of mouse FATP4 protein (SEQ ID 
NO:45). . 

Figures 44A, 44B, and 44C are the hsFATPl DNA sequence (SEQ ID NO:46). 
Coding region: 175-2115 (1941 nt). 
5 Figure 45 is the hsFATPl protein sequence (SEQ ID NO:47). 

Figures 46A and 46B are the hsFATP2 DNA sequence (SEQ ID NO:48). 
Coding region: 223-2085 (1863 nt). 

Figure 47 is the hsFATP2 protein sequence (SEQ ID NO:49). 
Figure 48 is the partial DNA sequence of hsFATP3 (SEQ ID NO:50). Coding 
10 region: 1-993. 

Figure 49 is the partial protein sequence of hsFATP3 (SEQ ID NO:51). 
Figures 50A, 50B, and 50C are the hsFATP4 DNA sequence (SEQ ID NO:52). 
Coding region: 208-2139 (1932 nt). 

Figure 5 1 is the hsFATP4 protein sequence (SEQ ID NO:53). 
1 5 Figure 52 is the hsFATP5 partial DNA sequence (SEQ ID NO:54). Coding 

region: 1-1062. 

Figure 53 is the hsFATP5 partial protein sequence (SEQ ID NO:55). 
Figures 54A, 54B, and 54C are the hsFATP6 DNA sequence (SEQ ID NO:56). 
Coding region: 643-2502 (1860 nt). 
2 0 Figure 55 is the hsFATP6 protein sequence (SEQ ID NO:57). 

Figures 56A, 56B, and 56C are the rnFATPl DNA sequence (m=Rattus 
norvegicus; (SEQ ID NO:58). Coding region: 75-2015 (1941 nt). 

Figure 57 is the rnFATPl protein sequence (SEQ ID NO:59). 
Figure 58A, 58B, and 58C are the rnFATP2 DNA sequence (SEQ ID NO:60). 
2 5 Coding region: 795-2657 (1 863 nt). 

Figure 59 is the niFATP2 protein sequence (SEQ ID NO:61). 
Figure 60A and 60B are the mFATP4 partial DNA sequence (SEQ ID NO:62). 
Coding region: 1-1218. 
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Figure 6 1 is the mFATP4 partial DNA sequence (SEQ ID NO:63). 

Figure 62A, 62B, and 62C are the mmFATPl DNA sequence (SEQ ID NO:64). 
Coding region: 1-1944. 

Figure 63 is the mmFATPl protein sequence (SEQ ID NO:65). 
5 Figures 64A and 64B are the mmFATP2 DNA sequence (SEQ ID NO:66). 

Coding region: 121-1992 (1872 nt). 

Figure 65 is the mmFATP2 protein sequence (SEQ 3D NO:67). 

Figures 66A and 66B are the mmFATP3 partial DNA sequence (SEQ ID 
NO:68). Coding region: 1-1830. 
1 0 Figure 67 is the mniFATP3 partial protein sequence (SEQ ID NO:69). 

Figures 68A, 68B 9 and 68C are the minFATP4 DNA sequence (SEQ ID NO:70). 
Coding region: 1-1932. 

Figures 69 is the mmFATP4 protein sequence (SEQ ID NO:71). 

Figures 70A and 70B are the mmFATPS DNA sequence (SEQ ID NO:72). 
15 Coding region: 60-2129. 

Figure 71 is the mmFATP5 protein sequence (SEQ ID NO:73). 

Figures 72A and 72B are the drnFATP partial DNA sequence (dm-Drosophila 
melanogaster, SEQ ID NO:74). Coding region: 1-1773. 

Figures 73 is the drnFATP partial protein sequence (SEQ ID NO:75). 
2 0 Figure 74 is the drFATP partial DNA sequence (dr=Danio rerio, zebrafish; SEQ 

IDNO-.76) Coding region: 1-173. 

Figure 75 is the drFATP partial protein sequence (SEQ ID NO:77). 

Figure 76A and 76B are the ceFATPa DNA sequence (SEQ ID NO:78). Coding 
region: 1-1953. 

2 5 Figure 77 is the ceFATPa protein sequence (SEQ ID NO:79). 

Figures 78A and 78B are the ceFATPb DNA sequence (SEQ ID NO:80). 
Coding region: 1-1968. 

Figure 79 is the ceFATPb protein sequence (SEQ ID NO:81). 
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Figures 80A and 80B are the chFATP DNA sequence (SEQ ID NO:82; 
ch—Cochlioboluheterostrophus). Coding region: 1-1932. 

Figure 81 is the chFATP protein sequence (SEQ ID NO:83). 

Figure 82 is the anFATP partial protein sequence (an=Aspergillns nidulans; 
5 SEQIDNO:84). Coding region: 1-597. 

Figure 83 is the anFATP partial protein sequence (SEQ ID NO:85). 

Figure 84 is the mgFATP partial DNA sequence (mg= Magnaporthe grisea, rice 
blast; SEQ ID NO: 86). Coding region: 1-522. 

Figure 85 is the mgFATP partial protein sequence (SEQ ID NO: 87). 
10 Figures 86A and 86B are the scFATP DNA sequence (SEQ ID NO:88). Coding 

region: 1-1872. 

Figure 87 is the scFATP protein sequence (SEQ ID NO:89). 

Figures 88A and 88B are the mtFATP DNA sequence (SEQ ID NO:90). 

Figure 89 is the mtFATP protein sequence (SEQ ID NO:91). Coding region: 1 - 

15 1794. 

Figure 90 is a concensus sequence of the FATP signature sequence (SEQ ID 

NO: 

100), based on 23 independent sequences aligned in CiustalX. The height of the bar at 
each amino acid residue position indicates the degree of conservation at mat position. 
2 0 Gaps have been inserted to maintain the strength of the alignment. 

Figure 91 is a hydrophilicity plot for hsFATP2, made using the Kyte-Doolittle 
method, averaging hydrophilicity values ibr 18 amino acid residues at a time. 

Figure 92 is a hydrophilicity plot for the hsFATP3 partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
25 at a time. 

Figure 93 is a hydrophilicity plot for the hsFATP5 partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
at a time. 
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Figures 94A and 94B are a representation of the DNA sequence (SEQ ID 
NO: 101) of the hsFATPS gene, and the amino acid sequence (SEQ ID NO: 102) of the 
hsFATP5 protein. 



DETAILED DESCRIPTION OF THE INVENTION 
5 As described herein, FATPs are a large evolutionary conserved family of 

proteins that mediate the transport of LCFAs into cells. The family includes proteins 
which are conserved from mycobacteria to humans and exhibit very different expression 
patterns in tissues. Specific embodiments described include FATPs from mice, humans, 
nematodes, fungi and mycobacteria which have been shown to be functional LCFA 

10 transporters. The term "fatty acid transport proteins" ("FATPs") as used herein, refers 
to the proteins described herein as FATP1 , FATP2, FATP3, FATP4, FATP5 and 
FATP6, which have been described in one or more species of mammals, as well as 
mtFATP, ceFATP, scFATP, anFATP, mgFATP, and chFATP, and other proteins 
sharing at least about 50% amino acid sequence similarity, preferably at least about 60% 

1 5 sequence similarity, more preferably at least about 70% sequence similarity, and still 
more preferably, at least about 80% sequence similarity, and most preferably, at least 
about 90% sequence similarity in the approximately 360 amino acid signature sequence. 
The approximaely 360 amino acid FATP signature sequence is shown in Figure 1. The 
concensus sequence of the signature sequence is shown in Figure 90. The nomenclature 

2 0 used herein to refer to FATPs includes a species-specific prefix (e.g., mm, Mus 

musculus; hs or h, Homo sapiens or human; mt M tuberculosis > dm. D. melanogaster; 
ce, C. elegans\ sc, Saccharomyces cereyisiae) and a number such that mammalian 
homologues in different species share the same number. For example, six human and 
five mouse FATP genes which are expressed in a variety of tissues are described herein 

2 5 and are referred to, respectively, as hsF ATP 1 -hsF ATP 6 and mmF ATP 1 -mmF ATP5 ; for 
example, hsFATP4 and mmFATP4 are the human and mouse orthologs. 
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Expression patterns of human and mouse FATPs have been assessed and are 
described below. Briefly, results of these assessments show that FATP5 is a liver- 
specific gene. FATP2 is highly expressed in liver and kidney. Both of these proteins, 
as well as FATP4 and FATPs from nematodes and mycobacteria, have been shown to 
5 be functional LCFA transporters. Results have also shown that FATP4 rnRNA is 
present at high levels in epithelial cells of two regions of the small intestine (the 
jejunum and ileum) and at lower, but significant, levels in a third region (the 
duodenum). They further showed that FATP2 mRNA is present in epithelial cells of the 
duodenum at a level similar to that of FATP4 mRNA levels, but is present at lower 

1 0 levels in the jejunum and ileum. FATP4 mRNA was absent from other cell types of the 
small intestine and no FATP4 mRNA could be detected in any cells of the colon. No 
signals above background could be detected for FATP1, FATP3 and FATPS in any of 
the intestinal tissues. Thus, FATP4 is the major FATP in the mouse small intestine, 
which supports a major role for FATP4 (along with FATP2 to a lesser extent) in 

15 absorption of free fatty acids. hsFATP4 was clearly expressed in the jejunum and 

ileum; expression was absent in the stomach. This, too, is consistent with a major role 
for FATP4 in absorption of fatty acids in the human gut. Analysis of FATP expression 
in human tissues, also described in detail below, showed that hsFATP6, which has no 
mouse ortholog as yet, is expressed at high levels in the heart and at low levels in the 

2 0 placenta, but is undetectable in the other tissues assessed (Example 9). This is 
consistent with a major role for FATP6 in absorption of fatty acids in die heart. 

Long chain fatty acids (LCFAs) are an important energy source for pro- and 
eukaryotes and are involved in diverse cellular processes, such as membrane synthesis, 
intracellular signaling, protein modification, and transcriptional regulation. In 

2 5 developed Western countries, human dietary lipids are mainly di- and triglycerides and 
account for approximately 40% of caloric intake (Weisburger, J. H. (1997) 1 Am. Diet 
Assoc. 97:S16-S23). These lipids are broken down into fatty acids and glycerol by 
pancreatic lipases in the small intestine (Chapus, C, Rovery, M., Sarda, L & Verger, R. 



WO 99/36537 



PCT/US99/00182 



-16- 

(1988) Biochimie 70:1223-34); LCFAs are then transported into brush border cells, 
where the majority is re-esterified and secreted into the lymphatic system as 
chylomicrons (Green, P.H. & Riley, J.W. (1981) Aust. N.Z.J. Med. 77:84-90). Fatty 
acids are liberated from lipoproteins by the enzyme lipoprotein lipase, which is bound to 
5 the luminal side of endothelial cells (Scow, R.O. & Blachette-Mackie, E.J. (1992) Mol 
Cell. Biochem 116: 181-191). "Free" fatty acids in the circulation are bound to serum 
albumin (Spector, A. A. (1984) Clin. Physiol. Biochem 2:123-134) and are rapidly 
incorporated by adipocytes, hepatocytes, and cardiac muscle cells. The latter derive 60- 
90% of their energy through the oxidation of LCFAs (Neely, J.F. Rovetto, M..T. & 

10 Oram, J.F. (1972) Prog. Cardiovasc. Dis: 75:289-329). Although saturable and specific 
uptake of LCFAs has been demonstrated for intestinal cells, hepatocytes, cardiac 
myocytes, and adipocytes, the molecular mechanisms of LCFA transport across the 
plasma membrane have remained controversial (Hui, T.Y. & Bemlohr, D.A. (1997) 
Front. Biosci. 75:d222-31-d231; Schaffer, J.E. ALodish, H.F, (1995) Trends 

15 Cardiovasc. Med. 5:218-224). Described herein is a large family of highly homologous 
mammalian LCFA transporters which show wide expression, including in all tissues 
relevant to fatty acid metabolism. Further described are novel members of this family 
in other species, including mycobacterial and nematode FATPs which, like their 
mammalian counterparts, are functional fatty acid transporters. 

2 0 The discovery of a diverse but highly homologous family of FATPs is 

reminiscent of the glucose transporter family, hi a manner similar to the FATPs, the 
glucose transporters have very divergent patterns of tissue expression (McGowan, K.M., 
Long, S.D. & Pekala, P.H. (1995) Pharmacol Ther. 66:465-505). The FATPs, like 
glucose transporters, may also differ in their substrate specificities, uptake kinetics, and 

2 5 hormonal regulation (Thorens, B. (1996) Am. J. Physiol 270:G541-G553). Indeed, the 
levels of fatty acids in the blood, like those of glucose, can be regulated by insulin and 
are dysregulated in diseases such as noninsulin-dependent diabetes and obesity (Boden, 
G. (1997) Diabetes 46:3-10). The underlying mechanisms for the regulation of free 
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fatty acid concentrations in the blood are not understood, but could be explained by . 
hormonal modulation of FATPs. 

Insulin-resistance is thought to be the major defect in non insulin-dependent 
diabetes mellitus (NIDDM) and is one of the earliest manifestations of NIDDM 
5 (McGarry (1992) Science 258:766-770). Free fatty acids (FFAs) may provide an 
explanation for why obesity is a risk factor for NIDDM. Plasma levels of FFAs are 
elevated in diabetic patients (Reaven et al (1988) Diabetes 37:1020). Elevated plasma 
free fatty acids (FFAs) have been demonstrated to induce insulin-resistance in whole 
animals and humans (Boden (1998) Front. BioscL 3:D169-D175). This insulin- 

10 resistance is likely mediated by effects of FFAs on a variety of issues. FFAs added to 
adipocytes in vitro induce insulin x*esistance in this cell type as evidenced by inhibition 
of insulin-induced glucose transport (Van Epps-Fung et aL (1997) Endocrinology 
138:4338-4345). Rats fed a high fat diet developed skeletal muscle insulin resistance as 
evidenced by a decrease in insulin-induced glucose uptake by skeletal muscle (Han et 

15 al, (1997) Diabetes 46:1761-1767). In addition, elevated plasma FFAs increase 

insulin-suppressed endogenous glucose production in the liver (Boden (1998) Front 
BioscL 3:D169-D175), thus increasing hepatic glucose output. It has been postulated 
that the adverse effects of plasma free fatty acids are due to the FFAs being taken up 
into the ceil, leading to an increase in intracellular long chain fatty acyl CoA; 

2 0 intracellular long chain acyl Co As are thought to mediate the effects of FFAs inside the 
cell. Thus, fatty acid induced insulin-resistance may be prevented by blocking uptake 
of FFAs into select tissues, in particular liver (by blocking FATP2 and/or FATP5), 
adipocyte (by blocking FATP1), and skeletal muscle (by blocking FATP1). Blocking 
intestinal fat absorption (by blocking FATP4) is also expected to reduce plasma FFA 

2 5 levels and thus improve insulin resistance. 

During the pathogenesis of NIDDM insulin-resistance can initially be 
counteracted by increasing insulin output by the pancreatic beta cell. Ultimately, this 
compensation fails, beta cell function decreases and overt diabetes results (McGarry 



f 
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(1992) Science 258: 766-770). Manipulating beta cell function is a second point where 
fatty acid transporter blockers may be beneficial for diabetes. While no FATP homolog 
has been identified so far that is expressed in the beta ceil of the pancreas, the data 
described below suggest the existence of such a transporter and the sequence 
5 information included herein provides the means to identify such a transporter by 

degenerate PCR, using primers to regions conserved in all FATP family members or by 
low stringency hybridization. It has been demonstrated that exposure of pancreatic 
beta-cells to FFAs increases the basal rate of insulin secretion; this in turn leads to a 
decrease in the intracellular stores of insulin, resulting in decreased capacity for insulin 

10 secretion after chronic exposure (Bollheimer et al. t (1998) J. Clin. Invest. 101:1094- 
1101). The effects of FFAs are again likely to be mediated by intracellular long chain 
fatty acylCoA molecules (Liu etal t (1998)7. Clin. Invest 101:1870-1875). FFAs 
have also been demonstrated to increase beta cell apoptosis (Shimabukuro et al t (1998) 
Proc. Nat. Acad. Sci. USA 95:2498-2502), possibly contributing to the decrease in beta 

1 5 cell numbers in late stage NIDDM. 

Another finding with potentially broad implications is the identification of a 
FATP homologue inM tuberculosis. Tuberculosis causes more deaths worldwide than 
any other infectious agent and drug-resistant tuberculosis is re-emerging as a problem in 
industrialized nations (Bloom, B.R. & Small, P.M. (1998) N. Engl J. Med. 338:611- 

2 0 678). Mycobacterium tuberculosis has about 250 enzymes involved in fatty acid 

metabolism, compared with only about 50 mE. coli. It has been suggested that, living 
as a pathogen, the mycobacteria are largely lipolytic, rather than lipogenic, relying on 
the lipds within mammalian cells and the tubercle (Cole, S.T. et at., Nature 393:531- 
544 (1 998)). The de novo synthesis of fatty acids in Mycobacterium leprae is 

2 5 insufficient to maintain growth (Wheeler, P.R., Bulmer, K & Ratledge, C. (1990) J. 
Gene. Microbiol. 130:21 1-217). Thus, it is reasonable to expect that inhibitors of 
mtFATP will serve as therapeutics for tuberculosis. FATPs expressed in mycobacteria 
can be targeted to reduce or prevent replication of mycobacteria (e.g., to reduce or 
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prevent replication of M tuberculosis) and, thus, reduce or prevent their adverse effects. 
For example, a FATP or FATPs expressed by M tuberculosis can be targeted and 
inhibited, thus reducing or preventing growth of this pathogen (and tuberculosis in 
humans and other mammals). An inhibitor of an M tuberculosis FATP can be 
5 identified, using methods described herein (e.g., expressing the FATP in an appropriate 
host cell, such as E. coli or COS cells; contacting the cells with an agent or drug to be 
assessed for its ability to inhibit the FATP and, as a result, mycobacterial growth, and 
assessing its effects on growth). A drug or agent identified in this manner can be further 
tested for its ability to inhibit aM tuberculosis FATP and M. tuberculosis infection in 

10 an appropriate animal model or in humans. A method of inhibiting mycobacteria] 
growth, particularly growth of M. tuberculosis, and compounds useful as drugs for 
doing so are also the subject of tins invention. 

An isolated polynucleotide encoding mtFATP, like other polynucleotides 
encoding FATPs of the FATP family, can be incorporated into vectors, nucleic acids of 

15 viruses, and other nucleic acid constructs that can be used in various types of host cells 
to produce mtFATP. This mtFATP can be used, as it appears on the surface of cells, or 
in various artificial membrane systems, to assess fatty acid transport function, to 
identify ligands and molecules that are modulators of fatty acid transport activity. 
Molecules found to be inhibitors of mtFATP function can be incorporated into 

2 0 pharmaceutical compositions to administer to a human for the treatment of tuberculosis. 

Particular embodiments of the invention are polynucleotides encoding a FATP 
of Cochliobolus (Helminthosporium) heterostrophus or portions or variants thereof, the 
isolated or recombinantly produced FATP, methods for assessing whether an agent 
binds to the chFATP, and further methods for assessing the effect of an agent being 

2 5 tested for its ability to modulate fatty acid transport activity. Cochliobolus 

heterostrophus is an ascomycete that is the cause of southern com leaf blight, an 
economically important threat to the com crop in the United States. The related species 
C. sativus causes crown rot and common root rot in wheat and barley. One or more 
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FATPs of C. heterostrophus can be targeted for the identification of an inhibitor of 
chFATP function, which can be then be used as an agent effective against infection of 
plants by C. heterostrophus and related organisms. Methods described herein that were 
applied in studying the expression of a FATP gene and the function of the FATP in its 
5 natural site of expression or in a host cell, can be used in the study of the chFATP gene 
and protein. 

Magnaporthe grisea (rice blast) is an economically important fungal pathogen of 
rice. Further embodiments of the invention are nucleic acid molecules encoding a 
FATP of Magnaporthe grisea, portions thereof, or variants thereof, isolated mgFATP, 

1 0 nucleic acid constructs, and engineered cells expressing mgFATP. Other aspects of the 
invention are assays to identify an agent which binds to mgFATP and assays to identify 
an agent which modulates the function of mgFATP in cells in which mgFATP is 
expressed or in artificial membrane systems. Agents identified as inhibiting mgFATP 
activity can be developed into anti-fungal agents to be used to treat rice infected with 

15 rice blast. 

Caenorhabditis elegans is a nematode related to plant pathogens and human 
parasites. An isolated polynucleotide which encodes ceFATP, like other 
polynucleotides encoding FATPs of the FATP family described herein, can be 
incorporated into nucleic acid vectors and other constructs that can be used in various 

2 0 types of cells to produce ceFATP. ceFATP as it occurs in cells or as it can be isolated 
or incorporated into various artificial or reconstructed membrane systems, can be used 
to assess fatty acid transport, and to identify ligands and agents that modulate fatty acid 
transport activity. Agents found by such assays to be inhibitors of ceFATP activity can 
be incorporated into compositions for the treatment of diseases caused by genetically 

25 related organisms with a FATP of similar sensitivity to the agents. 

Aspergillus nidulans is one of a family of fungal species that can infect humans. 
Further embodiments of the invention of the family of polynucleotides encoding FATPs 
are polynucleotides encoding a FATP of Aspergillus nidulans, and vectors and host 
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cells that can be constructed to comprise such polynucleotides. Further embodiments 
are a poiypeptide encoded by such polynucleotides, portions thereof having one or more 
functions characteristic of a FATP, and various methods. The methods include those 
for identifying agents that bind to anFATP and those for assessing the effect of an agent 
5 being tested for its ability to modulate fatty acid transport activity. Those agents found 
to inhibit fatty acid transport function can be used in compositions as anti-fungal 
pharmaceuticals, or can be modified for greater effectiveness as a pharmaceutical. 

One aspect of the invention relates to isolated nucleic acids that encode a FATP 
as described herein, such as those FATPs having an amino acid sequence in Figure 45 

1 0 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:5 1), Figure 5 1 
(SEQ ID NO:53), Figures 94A and 94B (SEQ ID NO: 102), and Figure 55 (SEQ ID 
NO:57) and nucleic acids closely related thereto as described herein. 

Using the information provided herein, such as a nucleic acid sequence set forth 
in Figures 44A-44C (SEQ ID NO:46), Figures 46A and 46B (SEQ ID NO:48), Figure 

15 48 (SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ 
ID NO: 101), and Figures 54A-54C (SEQ ID NO:56), a nucleic acid of the invention 
encoding a FATP polypeptide may be obtained using standard cloning and screening 
methods, such as those for cloning and sequencing cDNA library fragments, followed 
by obtaining a full length clone. For example, to obtain a nucleic acid of the invention, 

2 0 a library of clones of cDNA of human or other mammalian DNA can be probed with a 
labeled oligonucleotide, such as a radiolabeled oligonucleotide, preferably about 17 
nucleotides or longer, derived from a partial sequence. Clones carrying DNA identical 
to that of the probe can then be distinguished using stringent (also, "high stringency") 
hybridization conditions. By sequencing the individual clones thus identified with 

2 5 sequencing primers designed from the original sequence it is then possible to extend the 
sequence in both directions to determine the full length sequence. Suitable techniques 
are described, for example, in Current Protocols in Molecular Biology (F.M. Ausubel et 



WO 99/36537 



PCT/US99/00182 



-22- 

al, eds), containing supplements through Supplement 42, 1998, John Wiley and Sons, 
Inc., especially chapters 5, 6 and 7. 

Embodiments of the invention include isolated nucleic acid molecules 
comprising any of the following nucleotide sequences: L) a nucleotide sequence which 
5 encodes a protein comprising the amino acid sequence of hsFATPl (SEQ ID NO:47), 
the amino acid sequence of hsFATP2 (SEQ ID NO:49), the amino acid sequence of 
hsFATP3 (SEQ ID NO:51), the amino acid sequence of hsFATP4 (SEQ ID NO: 53), the 
amino acid sequence of hsFATPS (SEQ ID NO: 102) or the amino acid sequence of 
hsFATP6 (SEQ ID NO:57); 2.) nucleotide sequences of hsFATPl, hsFATP2, 

10 hsFATP3, hsFATP4, hsFATPS, or hsFATP6 (SEQ ID NO:46, 48, 50, 52, 101, or 56, 
respectively); J.) a nucleotide sequence which is complementary to the nucleotide 
sequence of hsFATPl (SEQ ID NO:46), hsFATP2 (SEQ ID NO:48), hsFATP3 (SEQ 
ID NO:50), hsFATP4 (SEQ ID NO:52), hsFATP5 (SEQ ID NO:101) or hsFATP6 (SEQ 
ID NO:56); 4.) a nucleotide sequence which consists of the coding region of hsFATPl 

1 5 (SEQ ID NO:46), the coding region of hsFATP2 (SEQ ID NO:48), the coding region of 
hsFATP3 (SEQ ID NO:50), the coding region of hsFATP4 (SEQ ID NO:52), the coding 
region of hsFATPS (SEQ ID NO: 101), or the coding region of hsFATP6 (SEQ ID 
NO:56). 

The invention further relates to nucleic acids (nucleic acid molecules or 
2 0 polynucleotides) having nucleotide sequences identical over their entire length to those 
shown in the figures, for instance Figures 44A-44C (SEQ ID NO:46), Figures 46 A and 
46B (SEQ ID NO:48), Figure 48 (SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), 
Figures 94A and 94B (SEQ ID NO: 101), and Figures 54A-54C (SEQ ID NO:56). It 
further relates to DNA, which due to the degeneracy of the genetic code, encodes a 
2 5 FATP encoded by one of the FATP-encoding DNAs, whose amino acid sequence is 
provided herein. Also provided by the invention are nucleic acids having the coding 
sequences for the mature polypeptides or fragments in reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- or 
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prepro- protein sequence. The nucleic acids of the invention encompass nucleic acids, 
that include a single continuous region or discontinuous regions encoding the 
polypeptide, together with additional regions, that may also contain coding or non- 
coding sequences. The nucleic acids may also contain non-coding sequences, including, 

5 for example, but not limited to, non-coding 5' and 3* sequences, such as the transcribed, 
non-translated sequences, termination signals, ribosome binding sites, sequences that 
stabilize mRNA, introns, polyadenylation signals, and additional coding sequences 
which encode additional amino acids. For example, a marker sequence that facilitates 
purification of the fused polypeptide can be encoded. In certain embodiments of the 

10 invention, the marker sequence can be a hexa-histidine peptide, as provided in the pQE 
vector (Qiagen, Inc.) and described in Gentz et al., Proc. Natl. Acad. Sci. USA 86: 821- 
824 (1989), or an HA tag (Wilson et al, Cell 37: 767 (1984)), or a sequence encoding 
glutathione S-transferase of Schistosoma japonicum (vectors available from Pharmacia; 
see Smith, D.B. and Johnson K.S., Gene 57:31 (1988) and Kaelin, W.G. et al, Cell 

15 70:351 (1992)). Nucleic acids of the invention also include, but are not limited to, 
nucleic acids comprising a structural gene and its naturally associated sequences that 
control gene expression. 

The invention further relates to variants, including naturally-occurring allelic 
variants, of those nucleic acids described specifically herein by DNA sequence, that 

2 0 encode variants of such polypeptides as those having the amino acid sequences shown 
in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), 
Figure 51 (SEQ ID NO:53) Figures 94A and 94B (SEQ ID NO:102), or Figure 55 (SEQ 
ID NO:57). Such variants include nucleic acids encoding val iants of the above- listed 
amino acid sequences, wherein those valiants have several, such as 5 to 10, 1 to 5, or 3, 

25 2 or 1 amino acids substituted, deleted, or added, in any combination. Variants include 
polynucleotides encoding polypeptides with at least 95% but less than 100% amino acid 
sequence identity to the polypeptides described herein by amino acid sequence. Valiant 
polynucleotides hybridize, under low to high stringency conditions, to the alleles 
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described herein by DNA sequence. In one embodiment, variants have silent 
substitutions, additions and deletions that do not alter the properties and activities of the 
FATP. Allelic variants of the polynucleotides encoding hsFATPl (Figure 45; SEQ ID 
NO:47), hsFATP2 (Figure 47; SEQ ID NO:49), hsFATP3 (Figure 49; SEQ ID NO:51), 
5 hsFATP4 (Figure 5 1 ; SEQ ID NO:53), Figures 94A and 94B (SEQ ID NO: 1 02) and 
hsFATP6 (Figure 55; SEQ ID N0.57) will be identified as mapping to chromosomal 
locations listed for the corresponding wild type genes in Table 2 in Example 1. 

Orthologous genes are gene loci in different species that are sufficiently similar 
to each other in their nucleotide sequences to suggest that they originated from a 
10 common ancestral gene. Orthologous genes arise when a lineage splits into two species, 
rather than when a gene is duplicated within a genome. Proteins that are orthologs are 
encoded by genes of two different species, wherein the genes are said to be orthologous. 

The invention further relates to polynucleotides encoding polypeptides which 
are orthologous to those polypeptides having a specific amino acid sequence described 
15 herein, such as the amino acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 
47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID NO:53), Figures 
94A and 94B (SEQ ID NO: 102), or Figure 55 (SEQ ID NO:57). These polynucleotides, 
which can be called ortholog polynucleotides, encode orthologous polypeptides that can 
range in amino acid sequence identity to a reference amino acid sequence described 
20 herein, from about 65% to less than 100%, but preferably 70% to 80%, more preferably 
80% to 90%, and still more preferably 90% to less than 100%. Orthologous 
polypeptides can also be those polypeptides that range in amino acid sequence similarity 
to a reference amino acid sequence described herein from about 75% to 100%, within 
the signature sequence. The amino acid sequence similarity between the signature 
2 5 sequences of orthologous polypeptides is preferably 80%, more preferably 90%, and 

still more preferably, 95%. The ortholog polynucleotides encode polypeptides that have 
similar functional characteristics (e.g., fatty acid transport activity) and similar tissue 
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distribution, as appropriate to the organism from which the ortholog polynucleotides 
can be isolated. 

Ortholog polynucleotides can be isolated from (e.g., by cloning or nucleic acid 
amplification methods) a great number of species, as shown by the sample of FATPs 
5 from evolutionariiy divergent species described herein (see, e.g., Figures 44A-C through 
Figure 89). Ortholog polynucleotides corresponding to those in Figure 45 (SEQ ID 
NO:47), Figure 47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID 
NO:53), Figures 94A and 94B (SEQ ID NO: 102) and Figure 55 (SEQ ID NO:57) are 
those which can be isolated from mammals such as rat, dog, chimpanzee, monkey, 

1 0 baboon, pig, rabbit and guinea pig, for example. 

Further variants that are fragments of the nucleic acids of the invention may be 
used to synthesize full-length nucleic acids of the invention, such as by use as primers in 
a polymerase chain reaction. As used herein, the term primer refers to a single-stranded 
oligonucleotide which acts as a point of initiation of template-directed DNA synthesis 

15 under appropriate conditions (e.g., in the presence of four different nucleoside 

triphosphates and an agent for polymerization, such as DNA or RNA polymerase or 
reverse transcriptase) in an appropriate buffer and at a suitable temperature. The 
appropriate length of a primer depends on the intended use of the primer, but typically 
ranges from 15 to 30 nucleotides. Short primer molecules generally require cooler 

20 temperatures to form sufficiently stable hybrid complexes with the template. A primer 
need not reflect the exact sequence of the template, but must be sufficiently 
complementary to hybridize with a template. The term primer site refers to the area of 
the target DNA to which a primer hybridizes. The term primer pair refers to a set of 
primers including a 5' (upstream) primer that hybridizes with the 5* end of the DNA 

2 5 sequence to be amplified and a 3' (downstream) primer that hybridizes with the 
complement of the 3' end of the sequence to be amplified. 

Further embodiments of the invention are nucleic acids that are at least 80% 
identical over their entire length to a nucleic acid described herein, for example a 
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nucleic acid having the nucleotide sequence in Figures 44A-44C (SEQ ID NO:46), 
Figures 46A-46B (SEQ ID NO:48), Figure 48 (SEQ ID NO:50), Figures 50A-50C 
(SEQ ID NO:52), Figures 94A and 94B (SEQ ID NO: 101), and Figures 54A-54C (SEQ 
ID NO: 56). Additional embodiments are nucleic acids, and the complements of such 
5 nucleic acids, having at least 90% nucleotide sequence identity to the above-described 
sequences, and nucleic acids having at least 95% nucleotide sequence identity. In 
preferred embodiments, DNA of the present invention has 97% nucleotide sequence 
identity, 98% nucleotide sequence identity, or at least 99% nucleotide sequence identity 
with the DNA whose sequences are presented herein. 

10 Other embodiments of the invention are nucleic acids that are at least 80% 

identical in nucleotide sequence to a nucleic acid encoding a polypeptide having an 
amino acid sequence as set forth in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID 
NO:49), Figure 49 (SEQ ID NO:51), Figure 5 1 (SEQ ID NO:53), Figures 94A and 94B 
(SEQ ID NO: 102) or Figure 55 (SEQ ID NO: 57), or as such amino acid sequences are 

15 set forth elsewhere herein, and nucleic acids that are complementary to such nucleic 

acids. Specific embodiments are nucleic acids having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide having an amino acid sequence as 
described in the list above, nucleic acids having at least 95% sequence identity, and 
nucleic acids having at least 97% sequence identity. 

2 0 The terms "complementary" or "complementarity" as used herein, refer to the 

natural binding of polynucleotides under permissive salt and temperature conditions by 
base-pairing. Complementarity between two single-stranded molecules may be 
"partial" in which only some of the nucleic acids bind, or it may be complete when total 
complementarity exists between the single-stranded molecules (that is, when A-T and 

2 5 G-C base pairing is 1 00% complete). The degree of complementarity between nucleic 
acid strands has significant effects on the efficiency and strength of hybridization 
between nucleic acid strands. This is of particular importance in amplification 
reactions, which depend on binding between nucleic acid strands. 
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Tlie invention further includes nucleic acids that hybridize to the above- 
described nucleic acids, especially those nucleic acids that hybridize under stringent 
hybridization conditions. "Stringent hybridization conditions" or "high stringency 
conditions" generally occur within a range from about T m minus 5°C (5 ° C below the 
5 strand dissociation temperature or melting temperature (T m ) of the probe nucleic acid 
molecule) to about 20° C to 25° C below T m . As will be understood by those of sicill in 
the art, the stringency of hybridization may be altered in order to identify or detect 
molecules having identical or related polynucleotide sequences. An example of high 
stringency hybridization follows. Hybridization solution is (6x SSC/10 mM 

10 EDT A/0.5% SDS/5x Denhardt's solution/100 ng/mi sheared and denatured salmon 

sperm DNA). Hybridization is at 64-65 °C for 16 hours. The hybridized blot is washed 
two times with 2x SSC/0.5% SDS solution at room temperature for 15 minutes each, 
and two times with 0.2x SSC/0.5% SDS at 65 °C, for one hour each. Further examples 
of high stringency conditions can be found on pages 2.10.1-2.10.16 (see particularly 

15 2.10.8-11) and pages 6.3.1-6 in Current Protocols in Molecular Biology (Ausubel, P.M. 
et aL 9 eds., containing supplements up through Supplement 42, 1998). Examples of 
high, medium, and low stringency conditions can be found on pages 36 and 37 of WO 
98/40404, which are incorporated herein by reference. 

The invention further relates to nucleic acids obtainable by screening an 

2 0 appropriate library with a probe having a nucleotide sequence such as that set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), Figure 48 
(SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ ID 
NO:101) or Figures 54A-54C (SEQ ID NO:56), or a probe which is a sufficiently long 
fragment of any of the above; and isolating the nucleic acid. Such probes generally can 

2 5 comprise at least 15 nucleotides. Nucleic acids obtainable by such screenings may 

include RNAs, cDNAs and genomic DNA, for example, encoding FATPs of the FATP 
family described herein. 
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Further uses for the nucleic acid molecules of the invention, whether encoding a 
full-length FATP or whether comprising a contiguous portion of a nucleic acid molecule 
such as one given in SEQ ID NO:46, 48, 50, 52, 1 01, or 56, include use as markers for 
tissues in which the corresponding protein is preferentially expressed (to identify 
5 constitutively expressed proteins or proteins produced at a particular stage of tissue 
differentiation or stage of development of a disease state); as molecular weight markers 
on southern gels; as chromosome markers or tags (when labeled, for example with 
biotin, a radioactive label or a fluorescent label) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in a mammal to 

1 0 identify potential genetic disorders; as probes to hybridize and thus identify, related 
DNA sequences; as a source of information to derive PCR primers for genetic 
fingerprinting; as a probe to "subtract-out" known sequences in the process of 
discovering other novel nucleic acid molecules; for selecting and making oligomers for 
attachment to a "gene chip" or other support, to be used, for example, for examination 

15 of expression patterns; to raise anti-protein antibodies using DNA immunization 

techniques; and as an antigen to raise anti-DNA antibodies or to elicit another immune 
response. 

Further methods to obtain nucleic acids encoding FATPs of the FATP family 
include PCR and variations thereof (e.g., "RACE" PCR and semi-specific PCR 

20 methods). Portions of the nucleic acids having a nucleotide sequence set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), Figure 48 
(SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ ID 
NO.101) or Figures 54A-54C (SEQ ID NO:56), (especially "flanking sequences" on 
either side of a coding region) can be used as primers in methods using the polymerase 

2 5 chain reaction, to produce DNA from an appropriate template nucleic acid. 

Once a fragment of the FATP gene is generated by PCR, it can be sequenced, 
and the sequence of the product can be compared to other DNA sequences, for example, 
by using the BLAST Network Service at the National Center for Biotechnology 
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Information. The boundaries of the open reading frame can then be identified using 
semi-specific PCR or other suitable methods such as library screening. Once the 5* 
initiator methionine codon and the 3 ' stop codon have been identified, a PCR product 
encoding the full-length gene can be generated using genomic DNA as a template, with 
5 primers complementary to the extreme 5' and 3* ends of the gene or to their flanking 
sequences. The full-length genes can then be cloned into expression vectors for the 
production of functional proteins. 

The invention also relates to isolated proteins or polypeptides such as those 
encoded by nucleic acids of the present invention. Isolated proteins can be purified 
1 0 from a natural source or can be made recombinantly. Proteins or polypeptides referred 
to herein as "isolated" are proteins or polypeptides that exist in a state different from the 
state in which they exist in cells in which they are normally expressed in an organism, 
and include proteins or polypeptides obtained by methods described herein, similar 
methods or other suitable methods, and also include essentially pure proteins oi- 
ls polypeptides, proteins or polypeptides produced by chemical synthesis or by 
combinations of biological and chemical methods, and recombinant proteins or 
polypeptides which are isolated. Thus, the term "isolated" as used herein, indicates that 
the polypeptide in question exists in a physical milieu distinct from that in which it 
occurs in nature. Thus, "isolated" includes existing in membrane fragments and vesicles 
2 0 membrane fractions, liposomes, lipid bilayers and other artificial membrane systems. 
An isolated FATP may be substantially isolated with respect to the complex cellular 
milieu in which it naturally occurs, and may even be purified essentially to 
homogeneity, for example as determined by PAGE or column chromatography (for 
example, HPLC), but may also have further cofactors or molecular stabilizers, such as 
2 5 detergents, added to the purified protein to enhance activity. In one embodiment, 

proteins or polypeptides are isolated to a state at least about 75% pure; more preferably 
at least about 85% pure, and still more preferably at least about 95% pure, as determined 
by Coomassi e blue staining of proteins on SDS-polyacryl amide gels. Proteins or 
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polypeptides refeiTed to herein as "recombinant" are proteins or polypeptides produced 
by the expression of recombinant nucleic acids. 

In a preferred embodiment, an isolated polypeptide comprising a FATP, a 
functional portion thereof, or a functional equivalent of the FATP, has at least one 
5 function characteristic of a FATP, for example, transport activity, binding function (e.g., 
a domain which binds to AMP), or antigenic function (e.g., binding of antibodies that 
also bind to a naturally-occurring FATP, as that function is found in an antigenic 
determinant). Functional equivalents can have activities that are quantitatively similar 
to, greater than, or less than, the reference protein. These proteins include, for example, 

1 0 naturally occurring FATPs that can be purified from tissues in which they are produced 
(including polymorphic or allelic variants), variants (e.g., mutants) of those proteins 
and/or portions thereof. Such variants include mutants differing by the addition, 
deletion or substitution of one or more amino acid residues, or modified polypeptides in 
which one or more residues are modified, and mutants comprising one or more modified 

15 residues. Portions or fragments of a FATP can range in size from four amino acid 
residues to the entire amino acid sequence minus one amino acid. 

The isolated proteins of the invention preferably include mammalian fatty acid 
transport proteins of the FATP family of homologous proteins. In one embodiment, the 
extent of amino acid sequence similarity between a polypeptide having one of the amino 

2 0 acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), 

Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID NO:53), Figures 94A and 94B (SEQ ID 
NO: 102), or Figure 55 (SEQ ID NO: 57), and the respective functional equivalents of 
these polypeptides is at least about 88%. In other embodiments, the degree of amino 
acid sequence similarity between a FATP and its respective functional equivalent is at 

2 5 least about 91%, at least about 94%, or at least about 97%. 

The polypeptides of the invention also include those FATPs encoded by 
polynucleotides which are orthologous to those polynucleotides, the sequences of which 
are described herein in whole or in part. FATPs which are orthologs to those described 
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herein by amino acid sequence, in whole or in part, are, for example fatty acid transport 
proteins 1-6 of dog, rat chimpanzee, monkey, rabbit, guinea pig, baboon and pig, and 
are also embodiments of the invention. 

To determine the percent identity or similarity of two amino acid sequences or 
5 of two nucleic acid sequences, the sequences are aligned for optimal comparison 

purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid 
or nucleic acid sequence for optimal alignment, and non-homologous (dissimilar) 
sequences can be disregarded for comparison purposes). In a preferred embodiment, the 
length of a reference sequence aligned for comparison purposes is at least 30%, 

10 preferably at least 40%, more preferably at least 50%, even more preferably at least 

60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference 
sequence. The amino acid residues or nucleotides at corresponding amino acid 
positions or nucleotide positions are then compared. When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 

15 position in the second sequence, then the molecules are identical at that position (as 

used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic 
acid "similarity"). The percent identity between the two sequences is a function of the 
number of identical positions shared by the sequences, taking into account the number 
of gaps, and the length of each gap, which need to be introduced for optimal alignment 

20 of the two sequences . 

The invention also encompasses polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 
performed by the polypeptides described herein by amino acid sequence. Similarity for 
a polypeptide is detennined by conserved amino acid substitution. Such substitutions 

25 are those that substitute a given amino acid in a polypeptide by another amino acid of 
like characteristics. Conservative substitutions are likely to be phenotypically silent. 
Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxy! 
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residues Ser and Tin*, exchange of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino 
acid changes are likely to be phenotypically silent is found in Bowie et ai, Science 
5 247:1306-1310(1990). 
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TABLE 1. Conservative Amino Acid Substitutions 



Aromatic 




Phenylalanine 








Tryptophan 








Tyrosine 




Hydrophobic 




Leucine 








Isoleucine 








\ 7 t * 

Valine 




rolar 




Glutamine 








Asparagine 




Basic 




Arginine 








Lysine 








Histidine 




Acidic 




Aspartic Acid 








Glutamic Acid 




Small 




Alanine 








Serine 








Threonine 








Methionine 








Glycine 





The comparison of sequences and determination of percent identity and 
similarity between two sequences can be accomplished using a mathematical algorithm. 
1 0 {Computational Molecular Biology, Lesk, A.M.,ed., Oxford University Press, New 
York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part 1, 
Griffm, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
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Analysis in Molecular Biology, vonHeinje, G:, Academic Press, 1987; and Sequence 
Analysis Primer, Gribskov, M. and Devereaux, J., eds., M. Stockton 
Press, New York, 1991). In a preferred embodiment, the percent identity between two 
amino acid sequences is determined using the Needleman and Wunsch (J. Mol Biol 
5 (48):444-453 (1 970)) algorithm which has been incorporated into the GAP program in 
the GCG software package (available at http://www.gcg.com), using either a Blossom 
62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined using the GAP program in the 

1 0 GCG software package (Devereux, J., et al, Nucleic Acids Res, !2(1):2>%1 (1984)) 

(available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight 
of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is 
determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) 

1 5 which has been incorporated into the ALIGN program (version 2.0), using a P AMI 20 
weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
used as a "query sequence" to perform a search against databases to, for example, 
identify other family members or related sequences. Such searches can be performed 

2 0 using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (7. Mol 
Biol 21 J:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score - 100, word length = 12 to obtain nucleotide sequences 
homologous to (with calculatably significant similarity to) the nucleic acid molecules of 
the invention. BLAST protein searches can be performed with the XBLAST program, 

2 5 score = 50, word length = 3 to obtain amino acid sequences homologous to the proteins 
of the invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al t (Nucleic Acids Res. 25(17):3389- 
3402 (1997)). When utilizing BLAST and gapped BLAST programs, the default 
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parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

Similarity for nucleotide and amino acid sequences can be defined in tenns of 
the parameters set by the Advanced Blast search available from NCBI (the National 
5 Center for Biotechnology Information; see, for Advanced BLAST page, 

www.ncbi.nlra.iiih.gov/cgi-bin/BLAST/nph-newblast?Jform : =l). These default 
parameters, recommended for a query molecule of length greater than 85 amino acid 
residues or nucleotides have been set as follows: gap existence cost, 11, per residue gap 
cost, 1 ; lambda ratio, 0.85. Further explanation of version 2.0 of BLAST can be found 
10 on related website pages and in Altschul, S.F. et al, Nucleic Acids Res. 25:3389-3402 
(1997). 

The invention further relates to fusion proteins, comprising a FATP or 
functional portion thereof (as described above) as a first moiety, linked to second 
moiety not occurring in the FATP as found in nature. Thus, the second moiety can be 

15 an amino acid, peptide or polypeptide. The first moiety can be in an N-terminal 

location, C-terminal location or internal to the fusion protein. In one embodiment, the 
fusion protein comprises a FATP as the first moiety, and a second moiety comprising a 
linker sequence and an affinity ligand. Fusion proteins can be produced by a variety of 
methods. For example, a fusion protein can be produced by the insertion of a FATP 

2 0 gene or portion thereof into a suitable expression vector, such as Bluescript SK +/- 
(Stratagene), pGEX-4T-2 (Phannacia), pET-24(+) (Novagen), or vectors of similar 
construction. The resulting construct can be introduced into a suitable host cell for 
expression. Upon expression, fusion protein can be purified from cells by means of a 
suitable affinity matrix (See e.g., Current Protocols in Molecular Biology, Ausubel, 

25 F.M. et al. 7 eds., Vol. 2, pp. 16.4.1-16.7.8, containing supplements up through 
Supplement 42, 1998). 

The invention also relates to enzymatically produced, synthetically produced, or 
recombinantly produced portions of a fatty acid transport protein. Portions of a FATP 



WO 99/36537 



PCT/US99/00182 



-36- 

can be made which have full or partial function on their own, or which when mixed . 
together (though fully, partially, or nonfunctional alone), spontaneously assemble with 
one or more other polypeptides to reconstitute a functional protein having at least one 
function characteristic of a FATP. 
5 Fragments of a FATP can be produced by direct peptide synthesis, for example 

those using solid-phase techniques (Roberge, J.Y. et at., Science 269:202-204 (1995); 
Merrifield, J., J. Am. Chem. Soc. 55:2149-2154 (1963)). Protein synthesis can be 
performed using manual techniques or by automation. Automated synthesis can be 
carried out using, for instance, an Applied Biosystems 431 A Peptide Synthesizer 

1 0 (Perkin Elmer). Various fragments of a FATP can be synthesized separately and 
combined using chemical methods. 

One aspect of the invention is a peptide or polypeptide having the amino acid 
sequence of a portion of a fatty acid transport protein which is hydrophilic rather than 
hydrophobic, and ordinarily can be detected as facing the outside of the cell membrane. 

15 Such a peptide or polypeptide can be thought of as being an extracellular domain of the 
FATP, or a mimetic of said extracellular domain. It is known, for example, that a 
portion of human FATP4 that includes a highly conserved motif is involved in AMP- 
CoA binding function (Stuhlsatz-Krouper, S.M. et aL, J. Biol Chem. 44:28642-28650 
(1998)). 

2 0 The term "mimetic" as used herein, refers to a molecule, the structure of which 

is developed from knowledge of the structure of the FATP of interest, or one or more 
portions thereof, and, as such, is able to effect some or all of the functions of a FATP. 

Portions of an FATP can be prepared by enzymatic cleavage of the isolated 
protein, or can be made by chemical synthesis methods. Portions of a FATP can also be 

2 5 made by recombinant DNA methods in which restriction fragments, or fragments that 
may have undergone further enzymatic processing, or synthetically made DNAs are 
joined together to construct an altered FATP gene. The gene can be made such that it 
encodes one or more desired portions of a FATP. These portions of FATP can be 
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entirely homologous to a known FATP, or can be altered in amino acid sequence 
relative to naturally occurring FATPs to enhance or introduce desired properties such as 
solubility, stability, or affinity to a ligand. A further feature of the gene can be a 
sequence encoding an N-terminal signal peptide directed to the plasma membrane. 
5 An extracellular domain can be determined by a hydrophobicity plot, such as 

those shown in Figures 28 A, 29A, and 3 5 A, or by a hydrophilicity plot such as those 
shown in Figures 28C, 29C, 35C, 91 , 92 and 93. A polypeptide or peptide comprising 
all or a portion of a FATP extracellular domain can be used in a pharmaceutical 
composition. When administered to a mammal by an appropriate route, the polypeptide 

10 or peptide can bind to fatty acids and compete with the native FATPs in the membrane 
of cells, thereby making fewer fatty acid molecules available as substrates for transport 
into cells, and reducing the amount of fatty acids taken up by, for example, the heart, in 
the case of FATP6. 

Another aspect of the invention relates to a method of producing a fatty acid 

15 transport protein, variants or portions thereof, and to expression systems and host cells 
containing a vector appropriate for expression of a fatty acid transport protein. 

Cells that express a FATP, a variant or a portion thereof, or an ortholog of a 
FATP described herein by amino acid sequence, can be made and maintained in culture, 
under conditions suitable for expression, to produce protein in the cells for cell-based 

2 0 assays, or to produce protein for isolation. These cells can be procaryotic or eucaryotic. 
Examples of procaryotic cells that can be used for expression include Escherichia coli, 
Bacillus subtilis and other bacteria. Examples of eucaryotic cells that can be used for 
expression include yeasts such as Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Pichia pastor is and other lower eucaryotic cells, and cells of higher eucaryotes 

2 5 such as those from insects and mammals, such as primary cells and cell lines such as 

CHO, HeLa, 3T3 and BHK cells, preferably COS cells and human kidney 293 cells, and 
more preferably Jurkat cells. (See, e.g., Ausubel, F.M. et al, eds. Current Protocols in 
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Molecular Biology, Greene Publishing Associates and John Wiley & Sons, Inc., 
containing Supplements up through Supplement 42, 1998)). 

In one embodiment, host cells that produce a recombinant FATP, or a portion 
thereof, a variant, or an ortholog of a FATP described herein by amino acid sequence, 
5 can be made as follows. A gene encoding a FATP, variant or a portion thereof can be 
inserted into a nucleic acid vector, e.g., a DNA vector, such as a plasmid, phage, 
cosmid, phagemid, virus, virus-derived vector (e.g., SV40, vaccinia, adenovirus, fowl 
pox virus, pseudorabies viruses, retroviruses) or other suitable replicon, which can be 
present in a single copy or multiple copies, or the gene can be integrated in a host cell 

10 chromosome. A suitable replicon or integrated gene can contain all or part of the 
coding sequence for a FATP or variant, operably linked to one or more expression 
control regions whereby the coding sequence is under the control of transcription 
signals and linked to appropriate translation signals to permit translation. The vector 
can be introduced into cells by a method appropriate to the type of host cells (e.g., 

15 transfection, electroporation, infection). For expression from the FATP gene, the host 
cells can be maintained under appropriate conditions (e.g., in the presence of inducer, 
normal growth conditions, etc.). Proteins or polypeptides thus produced can be 
recovered (e.g., from the cells, as in a membrane fraction, from the periplasmic space of 
bacteria, from culture medium) using suitable techniques. Appropriate membrane 

2 0 targeting signals may be incorporated into the expressed polypeptide. These signals 
may be endogenous to the polypeptide or they may be heterologous signals. 

Polypeptides of the invention can be recovered and purified from cell cultures 
(or from their primary cell source) by well-known methods including ammoniiun sulfate 
or ethanol precipitation, acid extraction, anion or cation exchange chromatography, 

2 5 phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and high performance liquid 
chromatography. Known methods for refolding protein can be used to regenerate active 
conformation if the polypeptide is denatured during isolation or purification. 
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In a furtlier aspect of the invention are methods for assessing the transport 
function of any of the fatty acid transport proteins or polypeptides described herein, 
including orthologs, and in variations of these, methods for identifying an inhibitor (or 
an enhancer) of such function and methods for assessing the transport function in the 
5 presence of a candidate inhibitor or a known inhibitor. 

A variety of systems comprising living cells can be used for these methods. 
Cells to be used in fatty acid transport assays, and further in methods for identifying an 
inhibitor or enhancer of this function, express one or more FATPs. See Examples 3, 6, 
9, 12 and 14 for data on tissue distribution of expression of FATPs, and Examples 10 

10 and 1 1 describing recombinant cells expressing FATP. Cells for use in cell-based 

assays described herein can be drawn from a variety of sources, such as isolated primary 
cells of various organs and tissues wherein one or more FATPs are naturally expressed. 
In some cases, the cells can be from adult organs, and in some cases, from embryonic or 
fetal organs, such as heart, lung, liver, intestine, skeletal muscle, kidney and the like. 

1 5 Cells for this purpose can also include cells cultured as fragments of organs or in 
conditions simulating the cell type and/or tissue organization of organs, in which 
artificial materials may be used as substrates for cell growth. Other types of cells 
suitable for this purpose include cells of a cell strain or cell line (ordinarily comprising 
cells considered to be "transformed") transfected to express one or more FATPs. 

20 A further embodiment of the invention is a method for detecting, in a sample of 

cells, a fatty acid transport protein, a portion or fragment thereof, a fusion protein 
comprising a FATP or a portion thereof, or an ortholog as described herein, wherein the 
cells can be, for instance, cells of a tissue, primary culture cells, or cells of a cell line, 
including cells into which nucleic acid has been introduced. The method comprises 

2 5 adding to the sample an agent that specifically binds to the protein, and detecting the 
agent specifically bound to the protein. Appropriate washing steps can be added to 
reduce nonspecific binding to the agent. The agent can be, for example, an antibody, a 
ligand or a substrate mimic. The agent can have incorporated into it, or have bound to 
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it, covalentiy or by high affinity non-covalent interactions, for instance, a label that 
facilitates detection of the agent to which it is bound, wherein the label can be, but is not 
limited to, a phosphorescent label, a fluorescent label, a biotin or avidin label, or a 
radioactive label. The means of detection of a fatty acid transport protein can vary, as 
5 appropriate to the agent and label used. For example, for an antibody that binds to the 
fatty acid transport protein, the means of detection may call for binding a second 
antibody, which has been conjugated to an enzyme, to the antibody which binds the 
fatty acid transport protein, and detecting the presence of the second antibody by means 
of the enzymatic activity of the conjugated enzyme. 

1 0 Similar principles can also be applied to a cell lysate or a more purified 

preparation of proteins from cells that may comprise a fatty acid transport protein of 
interest, for example in the methods of immunoprecipitation, immunoblotting, 
immunoaffmity methods, that in addition to detection of the particular FATP, can also 
be used in purification steps, and qualitative and quantitative immunoassays. See, for 

15 instance, chapters 1 1 through 14 in Antibodies: A Laboratory Manual, E. Harlow and 
D. Lane, eds., Cold Spring Harbor Laboratory, 1988. 

Isolated fatty acid transport protein or, an antigenically similar portion thereof, 
especially a portion that is soluble, can be used in a method to select and identify 
molecules which bind specifically to the FATP. Fusion proteins comprising all of, or a 

2 0 portion of, the fatty acid transport protein linked to a second moiety not occurring in the 
FATP as found in nature, can be prepared for use in another embodiment of the method. 
Suitable fusion proteins for this purpose include those in which the second moiety 
comprises an affinity ligand (e.g., an enzyme, antigen, epitope). FATP fusion proteins 
can be produced by the insertion of a gene encoding the FATP or a variant thereof, or a 

25 suitable portion of such gene into a suitable expression vector, which encodes an 

affinity ligand (e.g., pGEX-4T-2 and pET-15b, encoding glutathione S-transferase and 
His-Tag affinity ligands, respectively). The expression vector can be introduced into a 
suitable host cell for expression. Host cells are lysed and the lysate, containing fusion 
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protein, can be bound to a suitable affinity matrix by contacting the lysate with an 
affinity matrix. 

Iii one embodiment, the fusion protein can be immobilized on a suitable affinity 
matrix under conditions sufficient to bind the affinity ligand portion of the fusion 
5 protein to the matrix, and is contacted with one or more candidate binding agents (e^g., a 
mixture of peptides) to be tested, under conditions suitable for binding of the binding 
agents to the FATP portion of the bound fusion protein. Next, the affinity matrix with 
bound fusion protein can be washed with a suitable wash buffer to remove unbound 
candidate binding agents and non-specifically bound candidate binding agents. Those 

10 agents which remain bound can be released by contacting the affinity matrix with fusion 
protein bound thereto with a suitable elution buffer. Wash buffer can be formulated to 
permit binding of the fusion protein to the affinity matrix, without significantly 
disrupting binding of specifically bound binding agents. In this aspect, elution buffer 
can be formulated to permit retention of the fusion protein by the affinity matrix, but 

15 can be formulated to interfere with binding of the candidate binding agents to the target 
portion of the fusion protein. For example, a change in the ionic strength or pH of the 
elution buffer can lead to release of specifically bound agent, or the elution buffer can 
comprise a release component or components designed to disrupt binding of specifically 
bound agent to the target portion of the fusion protein. 

2 0 Immobilization can be performed prior to, simultaneous with, or after, 

contacting the fusion protein with candidate binding agent, as appropriate. Various 
permutations of the method are possible, depending upon factors such as the candidate 
molecules tested, the affinity matrix-ligand pair selected, and elution buffer formulation. 
For example, after the wash step, fusion protein with binding agent molecules bound 

25 thereto can be eluted from the affinity matrix with a suitable elution buffer (a matrix 
elution buffer, such as glutathione for a GST fusion). Where the fusion protein 
comprises a cleavable linker, such as a thrombin cleavage site, cleavage from the 
affinity ligand can release a portion of the fusion with the candidate agent bound 
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thereto. Bound agent molecules can then be released from the fusion protein or its 
cleavage product by an appropriate method, such as extraction. 

One or more candidate binding agents can be tested simultaneously. Where a 
mixture of candidate binding agents is tested, those found to bind by the foregoing 
5 processes can be separated (as appropriate) and identified by suitable methods (e.g., 
PCR, sequencing, chromatography). Large libraries of candidate binding agents (e.g., 
peptides, RNA oligonucleotides) produced by combinatorial chemical synthesis or by 
other methods can be tested (see e.g., Ohlmeyer, M.H.J, et aL, Proc. Natl Acad. ScL 
USA 90:10922-10926 (1993) and DeWitt, S.H. et aL, Proa Natl. Acad. ScL USA 

1 0 20:6909-6913 (1993), relating to tagged compounds; see also Rutter, WJ. et aL U.S. 
Patent No. 5,010,175; Huebner, V.D. et aL, U.S. Patent No. 5,182,366; and Geysen, 
H.M., U.S. Patent No. 4,833,092). Random sequence RNA libraries (see Ellington, 
A.D. et aL, Nature 346:818-822 (1990); Bock, L.C. et aL, Nature 355:584-566 (1992); 
and Szostak, J.W., Trends in Biochem. ScL 77:89-93 (March, 1992)) can also be 

1 5 screened according to the present method to select RNA molecules which bind to a 
target FATP or F ATP fusion protein. Where binding agents selected from a 
combinatorial library by the present method carry unique tags, identification of 
individual biomolecules by chromatographic methods is possible. Where binding 
agents do not carry tags, chromatographic separation, followed by mass spectrometry to 

2 0 ascertain structure, can be used to identify binding agents selected by the method, for 
example. 

The invention also comprises a method for identifying an agent which inhibits 
interaction between a fatty acid transport protein (e.g., one comprising the amino acid 
sequence in SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:53, SEQ ID 
2 5 NO: 102, or SEQ ID NO:57), and a ligand of said protein. The FATP can be one 
described by amino acid sequence herein, a portion or fragment thereof, a variant 
thereof, or an ortholog thereof, or a FATP fusion protein. Here, a ligand can be, for 
instance, a substrate, or a substrate mimic, an antibody, or a compound, such as a 
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peptide, that binds with specificity to a site on the protein. The method comprises 
combining, not limited to a particular order, the fatty acid protein, the ligand of the 
protein, and a candidate agent to be assessed for its ability to inhibit interaction between 
the protein and the ligand, under conditions appropriate for interaction between the 
5 protein and the ligand (e.g., pH, salt, temperature conditions conducive to appropriate 
conformation and molecular interactions); determining the extent to which the protein 
and ligand interact; and comparing (1) the extent of protein-ligand interaction in the 
presence of candidate agent with (2) the extent of protein-ligand interaction in the 
absence of candidate agent, wherein if (1) is less than (2), then the candidate agent is 

1 0 one which inhibits interaction between the protein and the ligand. 

The method can be facilitated, for example, by using an experimental system 
which employs a solid support (column chromatography matrix, wall of a plate, 
microliter wells, column pore glass, pins to be submerged in a solution, beads, etc.) to 
which the protein can be attached. Accordingly, in one embodiment, the protein can be 

1 5 fixed to a solid phase directly or indirectly, by a linker. The candidate agent to be tested 
is added under conditions conducive for interaction and binding to the protein. The 
ligand is added to the solid phase system under conditions appropriate for binding. 
Excess ligand is removed, as by a series of washes done under conditions that do not 
disrupt protein-ligand interactions. Detection of bound ligand can be facilitated by 

2 0 using a ligand that carries a label (e.g., fluorescent, chemiluminescent, radioactive). In a 
control experiment, protein and ligand are allowed to interact in the absence of any 
candidate agent, under conditions otherwise identical to those used for the "test" 
conditions where candidate inhibiting agent is present, and any washes used in the test 
conditions are also used in the control. The extent to which ligand binds to the protein 

2 5 in the presence of candidate agent is compared to the extent to which ligand binds to the 
protein in the absence of the candidate agent. If the extent to which interaction of the 
protein and the ligand occurs is less in the presence of the candidate agent than in the 
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absence of the candidate agent, the candidate agent is an agent which inhibits interaction 
between the protein and the ligand of the protein. 

In a further embodiment, an inhibitor (or an enhancer) of a fatty acid transport 
protein can be identified. The method comprises steps which are, or are variations of 
5 the following: contacting the cells with fatty acid, wherein the fatty acid can be labeled 
for convenience of detection; contacting a first aliquot of the cells with an agent being 
tested as an inhibitor (or enhancer) of fatty acid uptake while maintaining a second 
aliquot of cells under the same conditions but without contact with the agent; and 
measuring (e.g., quantitating) fatty acid in the first and second aliquots of cells; wherein 
10 a lesser quantity of fatty acid in the first aliquot compared to that in the second aliquot 
is indicative that the agent is an inhibitor of fatty acid uptake by a fatty acid transport 
protein. A greater quantity of fatty acid in the first aliquot compar ed to that in the 
second aliquot is indicative that the agent is an enhancer of fatty acid uptake by a fatty 
acid transport protein. 

15 A particular embodiment of identifying an inhibitor or enhancer of fatty acid 

transport function employs the above steps, but also employs additional steps preceding 
those given above: introducing into cells of a cell strain or cell line ("host cells" for the 
intended introduction of, or after the introduction of, a vector) a vector comprising a 
fatty acid transport protein gene, wherein expression of the gene can be regulatable or 

2 0 constitutive, and providing conditions to the host cells under which expression of the 
gene can occur. 

The terms "contacting" and "combining" as used herein in the context of 
bringing molecules into close proximity to each other, can be accomplished by 
conventional means. For example, when referring to molecules that are soluble, 
2 5 contacting is achieved by adding the molecules together in a solution. "Contacting" can 
also be adding an agent to a test system, such as a vessel containing cells in tissue 
culture. 
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The term "inhibitor" or "antagonist", as used herein, refers to an agent which . 
blocks, diminishes, inhibits, hinders, limits, decreases, reduces, restricts or interferes 
with fatty acid transport into the cytoplasm of a cell, or alternatively and additionally, 
prevents or impedes the cellular effects associated with fatty acid transport. The term 
5 "enhancer" or "agonist", as used herein, refers to an agent which augments, enhances, or 
increases fatty acid transport into the cytoplasm of a cell. An antagonist will decrease 
fatty acid concentration, fatty acid metabolism and byproduct levels in the cell, leading 
to phenotypic and molecular changes. 

In order to produce a "host cell" type suitable for fatty acid uptake assays and for 

1 0 assays derived therefrom for identifying inhibitors or enhancers thereof, a nucleic acid 
vector can be constructed to comprise a gene encoding a fatty acid transport protein, for 
example, human FATP1 , FATP2, FATP3, FATP4, FATP5, FATP6, a mutant or variant 
thereof, an ortholog of the human proteins, such as mouse orthologs or orthologs found 
in other mammals, or a FATP family protein of origin in an organism other man a 

1 5 mammal. The gene of the vector can be regulatable, such as by the placement of the 
gene under the control of an inducible or repressible promoter in the vector (e.g., 
inducible or repressible by a change in growth conditions of the host cell harboring the 
vector, such as addition of inducer, binding or functional removal of repressor from the 
cell millieu, or change in temperature) such that expression of the FATP gene can be 

2 0 turned on or initiated by causing a change in growth conditions, thereby causing the 
protein encoded by the gene to be produced, in host cells comprising the vector, as a 
plasma membrane protein. Alternatively, the FATP gene can be constitutive! y 
expressed. 

A vector comprising an FATP gene, such as a vector described herein, can be 
2 5 introduced into host cells by a means appropriate to the vector and to the host cell type. 
For example, commonly used methods such as electroporation, transfection, for 
instance, transfection using CaCl 2 , and transduction (as for a virus or bacteriophage) can 
be used. Host cells can be, for example, mammalian cells such as primary culture cells 
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or cells of cell lines such as COS cells, 293 cells or Jurkat cells. Host cells can also be, 
in some cases, cells derived from insects, cells of insect cell lines, bacterial cells, such 
as E. coli, or yeast cells, such as S. cerevisiae. It is preferred that the fatty acid transport 
protein whose function is to be assessed, with or without a candidate inhibitor or 
5 enhancer, be produced in host cells whose ancestor cells originated in a species related 
to the species of origin of the FATP gene encoding the fatty acid transport protein. For 
example, it is preferable that tests of function or of inhibition or enhancement of a 
mammalian FATP be carried out in host mammalian cells producing the FATP, rather 
than bacterial cells or yeast cells. 

1 0 Host cells comprising a vector comprising a regulatable FATP gene can be 

treated so as to allow expression of the FATP gene and production of the encoded 
protein (e.g., by contacting the cells with an inducer compound that effects transcription 
from an inducible promoter operably linked to the FATP gene). 

The test agent (e.g., an agonist or antagonist) is added to the cells to be used in a 

1 5 fatty acid transport assay, in the presence or absence of test agent, under conditions 
suitable for production and/or maintenance of the expressed FATP in a conformation 
appropriate for association of the FATP with test agent and substrate. For example, 
conditions under which an agent is assessed, such as media and temperature 
requirements, can, initially, be similar to those necessary for transport of typical fatty 

2 0 acid substrates across the plasma membrane. One of ordinary skill in the art will know 
how to vary experimental conditions depending upon the biochemical nature of the test 
agent. The test agent can be added to the cells in the presence of fatty acid, or in the 
absence of fatty acid substrate, with the fatty acid substrate being added following the 
addition of the test agent. The concentration at which the test agent can be evaluated 

2 5 can be varied, as appropriate, to test for an increased effect with increasing 
concentrations. 

Test agents to be assessed for their effects on fatty acid transport can be any 
chemical (element, molecule, compound), made synthetically, made by recombinant 



WO 99/36537 



PCT/US99/00182 



-47- 

techniques or isolated from a natural source. For example, test agents can be peptides, 
polypeptides, peptoids, sugars, hormones, or nucleic acid molecules, such as antisense 
nucleic acid molecules. In addition, test agents can be small molecules or molecules of 
greater complexity made by combinatorial chemistry, for example, and compiled into 
5 libraries. These libraries can comprise, for example, alcohols, alkyl halides, amines, 
amides, esters, aldehydes, ethers and other classes of organic compounds. Test agents 
can also be natural or genetically engineered products isolated from lysates of cells, 
bacterial, animal or plant, or can be the cell lysates themselves. Presentation of test 
compounds to the test system can be in either an isolated form or as mixtures of 

10 compounds, especially in initial screening steps. 

Thus, the invention relates to a method for identifying agents which alter fatty 
acid transport, the method comprising providing the test agent to the cell (wherein "cell" 
includes the plural, and can include cells of a cell strain, cell line or culture of primary 
cells or organ culture, for example), under conditions suitable for binding to its target, 

1 5 whether to the FATP itself or to another target on or in the cell, wherein the transformed 
cell comprises a FATP. 

In greater detail, to test one or more agents or compounds (e.g., a mixture of 
compounds can conveniently be screened initially) for inhibition of the transport 
function of a fatty acid transport protein, the agent(s) can be contacted with the cells. 

2 0 The cells can be contacted with a labeled fatty acid. The fatty acid can be, for example, 
a known substrate of the fatty acid transport protein such as oleate or palmitate. The 
fatty acid can itself be labeled with a radioactive isotope, (e.g., 3 H or 14 C) or can have a 
radioactively labeled adduct attached. In other variations, the fatty acid can have 
chemically attached, to it a fluorescent label, or a substrate for an enzyme occurring 

2 5 within the cells, wherein the substrate yields a detectable product, such as a highly 

colored or fluorescent product. Addition of candidate inhibitors and labeled substrate to 
the cells comprising fatty acid transport protein can be in either order or can be 
simultaneous. 
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A second aliquot of cells, which can be called "control" cells (a "first" aliquot of 
cells can be called "test" cells), is treated, if necessary (as in the case of transformed 
"hosf'cells), so as to allow expression of the FAT? gene, and is contacted with the 
labeled substrate of the fatty acid transport protein. The second aliquot of cells is not 
5 contacted with one or more agents to be tested for inhibition of the transport function of 
the protein produced in the cells, but is otherwise kept under the same culture conditions 
as the first aliquot of cells. 

In a further step of a method to identify inhibitors of a fatty acid transport 
protein, the labeled fatty acid is measured in the first and second aliquots of cells. A 

1 0 preliminary step of this measurement process can be to separate the external medium 
from the cells so as to be able to distinguish the-labeled fatty acid external to the cells 
from that which has been transported inside the cells. This can be accomplished, for 
instance, by removing the cells from their growth container, centrifuging the cell 
suspension, removing the supernatant and performing one or more wash steps to 

1 5 extensively dilute the remaining medium which may contain labeled fatty acid. 

Detection of the labeled fatty acid can be by a means appropriate to the label used. For 
example, for a radioactive label, detection can be by scintillation counting of 
appropriately prepared samples of cells (e.g., lysates or protein extracts); for a 
fluorescent label, by measuring fluorescence in the cells by appropriate instrumentation. 

2 0 If a compound tested as a candidate inhibitor of transport function causes the test 

cells to have less labeled fatty acid detected in the cells than that detected in the control 
cells, then the compound is an inhibitor of the fatty acid transport protein. Procedures 
analogous to those above can be devised for identifying enhancers (agonists of FATPs) 
of fatty acid transport function wherein if the test cells contain more labeled fatty acid 

2 5 than that detected in the control cells, or if the fatty acid is taken up at a higher rate, then 
the compound being tested can be concluded to be an enhancer of the fatty acid 
transport protein. 
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Exaniple 13 describes use of an assay of this type to identify an inhibitor of a 
FATP. In Example 13, an antisense oligonucleotide which specifically inhibits 
biosynthesis of mmFATP4 was demonstrated to inhibit fatty acid uptake into mouse 
enterocytes. Similarly, antisense oligonucleotides directed towards specifically 
5 inhibiting the biosynthesis of FATP6 in heart cells, FATP5 in liver cells, FATP3 in lung 
cells, and FATP2 in colon cells, can be demonstrated as examples of "test agents" that 
inhibit fatty acid transport. 

Another assay to determine whether an agent is an inhibitor (or enhancer) of 
fatty acid transport employs animals, one or more of which are administered the agent, 

1 0 and one or more of which are maintained under similar conditions, but are not 

administered the agent. Both groups of animals are given fatty acids (e.g., orally, 
intravenously, by tube inserted into stomach or intestine), and the fatty acids taken up 
into a bodily fluid (e.g., serum) or into an organ or tissue of interest are measured from 
comparable samples taken from each group of animals. The fatty acids may carry a 

15 label (e.g., radioactive) to facilitate detection and quantitation of fatty acids taken up 
into the fluid or tissue being sampled. This type of assay can be used alone or can be 
used in addition to in vitro assays of a candidate inhibitor or enhancer. 

An agent determined to be an inhibitor (or enhancer) of FATP function, such as 
fatty acid binding and/or fatty acid uptake, can be administered to cells in culture, or in 

2 0 vivo, to a mammal (e.g. human) to inhibit (or enhance) FATP function. Such an agent 
may be one that acts directly on the FATP (for example, by binding) or can act on an 
intermediate in a biosynthetic pathway to produce FATP, such as transcription of the 
FATP gene, processing of the niRNA, or translation of the mRNA. An example of such 
an agent is antisense oligonucleotide. 

25 Antisense methods similar to those illustrated in Example 13 can be used to 

determine the target FATP of a compound or agent that has an inhibitory or enhancing 
effect on fatty acid uptake. For example, antisense oligonucleotide directed to the 
inhibition of FATP4 biosynthesis can be added to lung cells or cell lines derived from 
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lung cells. In addition, antisense oligonucleotides directed to the inhibition of other 
FATPs, except for FATP3, can also be added to the lung cells. The administration of 
antisense oligonucleotides in this manner ensures that the predominant FATP activity 
remaining in the cells comes from FATP3. After a period of incubation of the cells with 
5 the antisense oligonucleotides sufficient to deplete the plasma membrane of the FATPs 
whose biosynthesis has been inhibited, a test agent, preferably one that has been shown 
by some preliminary test to have an inhibitory or enhancing activity on fatty acid 
transport, can be added to the lung cells. If the test agent is now demonstrated, after 
treatment of the cells with antisense oligonucleotides, to have an inhibitory or 
1 0 enhancing activity on fatty acid transport in the lung cells, it can be concluded that the 
target of the test agent is FATP3, or a molecule involved in the biosynthesis or activity 
of FATP3. 

In another type of cell-based assay for uptake of fatty acids, a change of 
intracellular pH resulting from the uptake of fatty acids can be followed by an indicator 

1 5 fluorophore. The fluorophore can be taken up by the cells in a preincubation step. Fatty 
acids can be added to the cell medium, and after some period of incubation to allow 
FATP-mediated uptake of fatty acids, the change in X m „ of fluorescence can be 
measured, as an indicator of a change in intracellular pH, as the A roax of fluorescence of 
the fluorophore changes with the pH of its environment, thereby indicating uptake of 

2 0 fatty acids. One such fluorophore is BCECF (2', 7'-bis(2-carboxyethyl)-5(6)- 
carboxyfluorescein; Rink, T.J. et al, J. Cell. Biol. 95: 189 (1982)). 

In assays similar to those described above, a candidate inhibitor or enhancer of 
fatty acid transport function can be added (or mock-added, for control cultures) to 
cultures of cells engineered to express a desired FATP to which fatty acid substrate is 

2 5 also added. Inhibition of fatty acid uptake is indicated by a lack of the drop in pH, 
indicating fatty acid uptake, that is seen in control cells. Enhancement of fatty acid 
uptake is indicated by a decrease in intracellular pH, as compared to control cells not 
receiving the candidate enhancer of fatty acid transport function. 
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Yeast cells can be used in a similar cell-based assay for the uptake of fatty acids 
mediated by a FATP, and such an assay can be adapted to a screening assay for the 
identification of agents that inhibit or enhance fatty acid uptake by an FATP. Yeast 
cells lacking an endogenous FATP activity (mutated, disrupted or deleted for FAT1\ 
5 Faergeman, N.J. et at., J. Biol Chem. 272(13):8531-8538 (1997); Watkins, P.A. et al 7 
J. Biol. Chem. 273(29): 182 10-1 821 9 (1998)) can be engineered to harbor a related gene 
of the family of FATP-encoding genes, such as a mammalian FATP (e.g., human 
FATP4). 

Examples of expression vectors include pEG (Mitchell, D.A., et al, Yeast P:715- 
10 723 (1993)) and pDADl and pDAD2, which contain a GAL1 promoter (Davis, L. I. and 
Fink, G. R., Cell (57:965-978 (1990)). A variety of promoters are suitable for 
expression. Available yeast vectors offer a choice of promoters. In one embodiment, 
the inducible GAL J promoter is used. In another embodiment, the constitutive ADH1 
promoter (alcohol dehydrogenase; Bennetzen, J. L. and Hall, B. D., J. Biol. Chem. 
1 5 257:3026-303 1 (1 982)) can be used to express an inserted gene on glucose-containing 
media. An example of a vector suitable for expression of a heterologous FATP gene in 
yeast is pQB169. 

With the introduced FATP gene providing the only fatty acid transport protein 
function for the yeast cells, it is possible to study effect of the heterologous FATP on 

2 0 fatty acid transport into the yeast cells in isolation. Assays for the uptake of fatty acids 
into the yeast cells can be devised that are similar to those described above and/or those 
assays that have been illustrated in the Examples. Tests for candidate inhibitors or 
enhancers of the heterologous FATP can be done in cultures of yeast cells, wherein the 
yeast cells are incubated with fatty acid substrate and an agent to be tested as an 

2 5 inhibitor or enhancer of FATP function. FATP uptake after a period of time can be 
measured by analyzing the contents of the yeast cells for fatty acid substrate, as 
compared with control yeast cells incubated with the fatty acid, but not with the test 
agent. Yeast cells have the additional advantage, over mammalian cells in culture, for 
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example, that yeast cells can be forced to rely upon fatty acids as their only source of 
carbon, if the growth medium supplied to the yeast cells is formulated to contain no 
other source of carbon. Thus, the effect of the heterologous FATP on fatty acid uptake 
and metabolism in the engineered yeast cells can be amplified. An agent that efficiently 
5 blocks transport function of the heterologous FATP could result in death of the yeast 
cells. Thus, in this case, inhibition of function of the heterologous FATP can result in 
loss of viability. A simple measure of viability is turbidity of the yeast suspension 
culture, which can be adapted to a high throughput screening assay for effects of various 
agents to be tested, using microtiter plates or similar devices for small-volume cultures 

10 of the engineered yeast cells. 

Cell-free assays can also be used to measure the transport of fatty acids across a 
membrane, and therefor also to assess a test treatment or test agent for its effect on the 
rate or extent of fatty acid transport. An isolated FATP, for example in the presence of 
a detergent that preserves the native 3-dimensional structure of the FATP, or partially 

1 5 purified FATP, can be used in an artificial membrane system typically used to preserve 
the native conformation and activity of membrane proteins. Such systems include 
liposomes, artificial bilayers of phospholipids, isolated plasma membrane such as cell 
membrane fragments, cell membrane fractions, or cell membrane vesicles, and other 
systems in which the FATP can be properly oriented within the membrane to have 

2 0 transport activity. Assays for transport activity can be performed using methods 

analogous to those that can he used in cells engineered to predominantly express one 
FATP whose function is to be measured. A labeled (e.g., radioactively labeled) fatty 
acid substrate can be incubated with one side of a bilayer or in a suspension of 
liposomes constructed to integrate a properly oriented FATP. The accumulation of fatty 

2 5 acids with time can be measured, using appropriate means to detect the label (e.g., 
scintillation counting of medium on each side of the bilayer, or of the contents of 
liposomes isolated from the surrounding medium). Assays such as these can be adapted 
to use for the testing of agents which might interact with the FATP to produce an 
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iiihibitory or an enhancing effect on the rate or extent of fatty acid transport. That is, the 
above-described assay can be done in the presence or absence of the agent to be tested, 
and the results compared. 

For examples of isolation of membrane proteins (ADP/ATP earner and 
5 uncoupling protein), reconstitution into phospholipid vesicles, and assays of transport, 
seeKlingenberg, M. et al, Methods Enzymol 25(9:369-389 (1995). For an example of a 
membrane protein (phosphate carrier of Saccharomyces cerevisiae) that was purified 
and solubilized from.E. coli inclusion bodies, see Schroer, A. et al t J. Biol Chem. 273: 
14269-14276 (1998). The Glutl glucose transporter of rat has been expressed in yeast. 

10 A crude membrane fraction of the yeast was prepared and reconstituted with soybean 
phospholipids into liposomes. Glucose transport activity could be measured in the 
liposomes (Kasahara, T. and Kasahara, M., J. Biol. Chem. 273: 291 13-291 17 (1998)). 
Similar methods can be applied to the proteins and polypeptides of the invention. 
Another embodiment of the invention is a method for inhibiting fatty acid 

15 uptake in a mammal (e.g., a human), comprising administering to the mammal a 

therapeutically effective amount of an inhibitor of the transport function of one or more 
of the fatty acid transport proteins, thereby decreasing fatty acid uptake by cells 
comprising the fatty acid protein(s). Where it is desirable to reduce the uptake of fatty 
acids, for example, in the treatment of chronic obesity or as a part of a program of 

2 0 weight control or hyperlipidemia control in a human, one or more inhibitors of one or 
more of the fatty acid transport proteins can be administered in an effective dose, and by 
an effective route, for example, orally, or by an indwelling device that can deliver doses 
to tire small intestine. The inhibitor can be one identified by methods described herein, 
or can be one that is, for instance, structurally related to an inhibitor identified by 

2 5 methods described herein (e.g., having chemical adducts to better stabilize or solubilize 
the inhibitor). The invention further relates to compositions comprising inhibitors of 
fatty acid uptake in a mammal, which may further comprise pharmaceutical carriers 
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suitable for administration to a subject mammal, such as sterile solubilizing or 
emulsifying agents. 

A further embodiment of the present invention is a method of enhancing or 
increasing fatty acid uptake, such as enhancing or increasing LCFA uptake in the small 
5 intestine (e.g., to treat or prevent a malabsorption syndrome or other wasting condition) 
or in the liver (e.g., by an enhancer of FATP5 transport activity to treat acute liver 
failure) or in the kidney (e.g., by an enhancer of FATP2 transport activity to treat kidney 
failure). In this embodiment, a therapeutically effective amount of an enhancer of the 
transport function of one or more of the fatty acid transport proteins can be administered 

10 to a mammalian subject, with the result that fatty acid uptake in the small intestine is 
enhanced. In this embodiment, one or more enhancers of one or more of fatty acid 
transport proteins is administered in an effective dose and by a route (e.g., orally or by a 
device, such as an indwelling catheter or other device) which can deliver doses to the 
gut. The enhancer of FATP function (e.g., an enhancer of FATP4 function) can be 

15 identified by methods described herein or can be one that is structurally similar to an 
enhancer identified by methods described herein. 

Aerobic reperfusion of ischemic myocardium is a common clinical event which 
can occur during such treatments as cardiac surgery, angioplasty, and thrombolytic 
therapy after a myocardial infarction. During reperfusion, a rapid recovery of 

2 0 myocardial energy production is essential for the complete recovery of contractile 

function. Not only the extent of recovery of myocardial energy metabolism but also the 
type of energy substrate used by the heart during reperfusion are important detemiinants 
of functional recovery. Circulating fatty acid levels increase following acute 
myocardial infarction or during cardiac surgery, such that during and following 

2 5 ischemia the heart muscle can be exposed to very high concentrations of fatty acids 
(Lopaschuk, G.D. and W. C. Stanley, Science and Medicine (November/December 
1997)). High plasma fatty acid concentrations increase the severity of ischemic damage 
in a number of experimental models of cardiac ischemia and have been linked to 
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depression of mechanical function during aerobic reperfusion of previously ischemic 
hearts. Further data show that modifying fatty acid utilization can be beneficial for 
heart function in ischemia and can be a useful approach for the treatment of angina. 
See, e.g., Desideri and Celegon, Am. J. Cardiol 82(5A)\50Yi-53l^ Lopaschuk, Am. J. 
5 Cardiol 82 (5 A) :14K-17K. Plasma fatty acid concentrations can be reduced by 

administering to a human subject or other mammal an effective amount of an inhibitor 
of a FATP such as FATP2 or FATP4, thereby providing a way of reducing fatty acid 
utilization by the heart. 

In a further embodiment of the invention, a therapeutically effective amount of 

10 an inhibitor of hsFATP6 can be administered to a human patient by a suitable route, to 
reduce the uptake of fatty acids by cardiac muscle. This treatment is desirable in 
patients who are diagnosed as having, or who are at risk of, abnormal accumulations of 
fatty acids in the heart or a detrimentally high rate of uptake of fatty acids into the heart, 
because of ischemic heart disease, or following ischemia or trauma to the heart. 

15 The invention further relates to antibodies that bind to an isolated or 

recombinant fatty acid transport protein of the FATP family, including portions of 
antibodies, which can specifically recognize and bind to one or more FATPs. The 
antibodies and portions thereof of the invention include those which bind to one or more 
FATPs of mouse or other mammalian species. In a preferred embodiment, the 

2 0 antibodies specifically bind to a naturally occurring FATP of humans. The antibodies 
can be used in methods to detect or to purify a protein of the present invention or a 
portion thereof by various methods of immunoaffinity chromatography, to inhibit the 
function of a protein in a method of therapy, or to selectively inactivate an active site, or 
to study other aspects of the structure of these proteins, for example. 

2 5 The antibodies of the present invention can be polyclonal or monoclonal. The 

term antibody is intended to encompass both polyclonal and monoclonal antibodies. 
Antibodies of the present invention can be raised against an appropriate immunogen, 
including proteins or polypeptides of the present invention, such as an isolated or 




WO 99/36537 PCT/US99/001 82 

-56- 

recombinant FATP1, FATP2, FATP3, FATP4, FATP5, FATP6, mtFATP, ceFATPa, 
ceFATPb, scFATP or portions thereof, or synthetic molecules, such as synthetic 
peptides (e.g., conjugated to a suitable carrier). Preferred embodiments are antibodies 
that bind to any of the following: hsFATPl, hsFATP2, hsFATP3, hsFATP4, hsFATPS 
5 or hsFATP6. The immunogen can be a polypeptide comprising a portion of a FATP 
and having at least one function of a fatty acid transport protein, as described herein. 

The term antibody is also intended to encompass single chain antibodies, 
chimeric, humanized or primatized (CDR-grafted) antibodies and the like, as well as 
chimeric or CDR-grafted single chain antibodies, comprising portions from more than 

1 0 one species. For example, the chimeric antibodies can comprise portions of proteins 
derived from two different species, joined together chemically by conventional 
techniques or prepared as a single contiguous protein using genetic engineering 
techniques (e.g., DNA encoding the protein portions of the chimeric antibody can be 
expressed to produce a contiguous protein chain. See, e.g., Cabilly et al., U.S. Patent 

15 No. 4,816,567; Cabilly etal y European Patent No. 0,125,023 Bl; Boss et al, U.S. 

Patent No. 4,816,397; Boss et al, European Patent No. 0,120,694 B 1 ; Neuberger, M.S. 
etal, WO 86/01533; Neuberger, M.S. et al 9 European Patent No. 0,194,276 Bl; 
Winter, U.S. Patent No. 5,225,539; Winter, European Patent No. 0,239,400 Bl; Queen 
et al, U.S. Patent No. 5,585,089; and Queen et al, European Patent No. EP 0 451 216 

2 0 Bl . See also, Newman, R. et al, BioTechnology, 70:1455-1460 (1992), regarding 

primatized antibody, and Ladner et al, U.S. Patent No. 4,946,778 and Bird, R.E. et al, 
Science, 242:423-426 (1988) regarding single chain antibodies.) 

Whole antibodies and biologically functional fragments thereof are also 
encompassed by the term antibody. Biologically functional antibody fragments which 

2 5 can be used include those fragments sufficient for binding of the antibody fragment to a 
FATP to occur, such as Fv, Fab, Fab' and F(ab') 2 fragments. Such fragments can be 
produced by enzymatic cleavage or by recombinant techniques. For instance, papain or 
pepsin cleavage can generate Fab or F(ab') 2 fragments, respectively. Antibodies can 
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also be produced in a variety of truncated forms using antibody genes in which one or 
more stop codons have been introduced upstream of the natural stop site. For example, 
a chimeric gene encoding a F(ab') 2 heavy chain portion can be designed to include DNA 
sequences encoding the CH, domain and hinge region of the heavy chain. 
5 Preparation of immunizing antigen (whole cells comprising FATP on the cell 

surface or purified FATP), and polyclonal and monoclonal antibody production can be 
performed using any suitable technique. A variety of methods have been described (See 
e.g., Kohler et al, Nature, 256: 495-497 (1975) and Eur. ./. Immunol 6: 511-519 
(1976); Milstein et al, Nature 266: 550-552 (1977); Koprowski et al, U.S. Patent No. 

10 4,172,124; Harlow, E. and D. Lane, 1988, Antibodies: A Labor atoiy Manual y (Cold 

Spring Harbor Laboratory: Cold Spring Harbor, NY); Chapter 1 1 In Current Protocols 
In Molecular Biology, Vol. 2 (containing supplements up through Supplement 42, 
1998), Ausubel, P.M. et al, eds., (John Wiley & Sons: New York, NY)). Generally, a 
hybridoma can be produced by fusing a suitable immortal cell line (e.g., a myeloma cell 

15 line such as SP2/0) with antibody producing cells. The antibody producing cells, 

preferably those obtained from the spleen or lymph nodes, can be obtained from animals 
immunized with the antigen of interest. Immunization of animals can be by 
introduction of whole cells comprising fatty acid transport protein on the cell surface. 
The fused cells (hybridomas) can be isolated using selective culture conditions, and 

2 0 cloned by limiting dilution. Cells which produce antibodies with the desired specificity 
can be selected by a suitable assay (e.g., ELISA). 

Other suitable methods of producing or isolating antibodies (including human 
antibodies) of the requisite specificity can used, including, for example, methods which 
select recombinant antibody from a library (e.g., Hoogenboom et al, WO 93/06213; 

25 Hoogenboom et al, U.S. Patent No. 5,565,332; WO 94/13804, published June 23, 
1994; and Dower, W.J. et al, U.S. Patent No. 5,427,908), or which rely upon 
immunization of transgenic animals (e.g., mice) capable of producing a full repertoire of 
human antibodies (see e.g., Jakobovits et al, Proc. Natl Acad. Set USA, 90: 2551-2555 
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(1993); Jakobovits et aL, Nature, 562:255-258 (1993); Lonberg et ai t U.S. Patent No. 
5,569,825; Lonberg et aL, U.S. Patent No. 5,545,806; Surani et aL, U.S. Patent No. 
5,545,807; and Kucherlapati, R. et aL, European Patent No. EP 0 463 151 Bl). 

Another aspect of the invention is a method for directing an agent to cardiac 
5 muscle. The differential expression of FATP6 in cardiac muscle but not in other tissue 
types allows for the specific targeting of drugs, diagnostic agents, tagging labels, 
histological stains or other substances specifically to cardiac muscle. A targeting 
vehicle can be used for the delivery of such a substance. Targeting vehicles which bind 
specifical ly to FATP6 can be linked to a substance to be delivered to the cells of cardiac 
1 0 muscle. The linkage can be, for instance, via one or more covalent bonds, or by high 
affinity non-covalent bonds. A targeting vehicle can be an antibody, for instance, or 
other compound (e.g., a fatty acid or fatty acid analog) which binds to FATP6 with high 
specificity. 

Targeting vehicles specific to the heart-specific protein FATP6 have in vivo 
15 (e.g., therapeutic and diagnostic) applications. For example, an antibody which 

specifically binds to FATP6 can be conjugated to a drug to be targeted to the heart (e.g., 
a cardiac glycoside to treat congestive heart failure, or P-adrenergic agents, sodium 
channel blockers or calcium channel blockers to treat anhythmias). A substance (e.g., a 
radioactive substance) which can be detected (e.g., a label) in vivo can also be linked to 
2 0 a targeting vehicle which specifically binds to a heart-specific protein such as FATP6, 
and the conjugate can be used as a labeling agent to identify cardiac muscle cells. 

Targeting vehicles specific to FATP6 find further applications in vitro. For 
example, an FATP6-specific targeting vehicle, such as an antibody (a polyclonal 
preparation or monoclonal) which specifically binds to FATP6, can be linked to a 
2 5 substance which can be used as a stain for a tissue sample (e.g., horseradish peroxidase) 
to provide a method for the identification of cardiac muscle in a sample, as can be used 
in embryology studies, for example. 
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In a similar manner, an agent can be directed to the liver of a mammal, as 
FATP5 is expressed in liver but not in other tissue types. A targeting vehicle which 
specifically binds to FATP5 can be conjugated to a drug for delivery of the drug to the 
liver, such as a drug to treat hepatitis, Wilson's disease, lipid storage diseases and liver 
5 cancer. As with targeting vehicles specific to FATP6, targeting vehicles specific to 
FATP5 can be used in studying tissue samples in vitro. 

The invention also relates to compositions comprising a modulator of FATP 
function. The term "modulate" as used herein refers to the ability of a molecule to alter 
the function of another molecule. Thus, modulate could mean, for example, inhibit, 

1 0 antagonize, agonize, upregulate, downregulate, induce, or suppress. A modulator has 
the capability of altering function of its target. Such alteration can be accomplished at 
any stage of the transcription, translation, expression or function of the protein, so that, 
for example, modulation of a target gene can be accomplished by modulation of the 
DNA or RNA encoding the protein, and the protein itself. 

15 Antagonists or agonists (inhibitors or enhancers) of the FATPs of the invention, 

antibodies that bind a FATP, or mimetics of a FATP can be employed in combination 
with a non-sterile or sterile earner or carriers for use with cells, tissues or organisms, 
such as a pharmaceutical carrier suitable for administration to a mammalian subject. 
Such compositions comprise, for instance, a media additive or a therapeutically 

2 0 effective amount of an inhibitor or enhancer compound to be identified by an assay of 
the invention and a pharmaceutically acceptable carrier or excipient. Such carriers may 
include, but are not limited to, saline, buffered saline, dextrose, water, ethanol, 
surfactants, such as glycerol, excipients such as lactose arid combinations thereof. The 
formulation can be chosen by one of ordinary skill in the ait to suit the mode of 

2 5 administration. The chosen route of administration will be influenced by the 

predominant tissue or organ location of the FATP whose function is to be inhibited or 
enhanced. For example, for affecting the function of FATP4, a preferred administration 
can be oral or through a tube inserted into the stomach (e.g., direct stomach tube or 
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nasopharyngeal tube), or through other means to accomplish delivery to the small 
intestine. The invention further relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the 
aforementioned compositions of the invention. 
5 Compounds of the invention which are FATPs, FATP fusion proteins, FATP 

mimetics, FATP gene-specific antisense poly- or oligonucleotides, inhibitors or 
enhancers of a FATP may be employed alone or in conjunction with other compounds, 
such as therapeutic compounds. The pharmaceutical compositions may be administered 
in any effective, convenient manner, including administration by topical, oral, anal, 

1 0 vaginal, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, 

transdermal or intradermal routes, among others. In therapy or as a prophylactic, the 
active agent may be administered to an individual as an injectable composition, for 
example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively, the composition may be formulated for topical application, for 

1 5 example, in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain . 
appropriate conventional additives, including, for example, preservatives, solvents to 
assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or 

2 0 ointment bases, and ethanol or oleyl alcohol for lotions. 

In addition, the amount of the compound will vary depending on the size, age, 
body weight, general health, sex, and diet of the host, and the time of administration, the 
biological half-life of the compound, and the particular characteristics and symptoms of 
the disorder to be treated. Adjustment and manipulation of established dose ranges are 

2 5 well within the ability of those of skill in the art. 

A further aspect of the invention is a method to identify a polymorphism, or the 
presence of an alternative or variant allele of a gene in the genome of an organism (of 
interest here, genes encoding FATPs). As used herein, polymorphism refers to the 
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occurrence of two or more genetically determined alternative sequences or alleles in a 
population. A polymorphic locus may be as small as a base pair. Polymorphic markers 
include restriction fragment length polymorphisms, variable number of tandem repeats 
(VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide 
5 repeats, tetranucleotide repeats, simple sequence repeats, and insertion elements such as 
Alu. The first identified alleleic form, or the most frequently occurring form can be 
arbitrarily designated as the reference (usually, "wildtype") form, and other allelic forms 
are designated as alternative (sometimes, "mutant" or "variant"). Dipolid organisms 
may be homozygous or heterozygous for allelic forms. 
10 An "allele" or "allelic sequence" is an alternative fonn of a gene which may 

result from at least one mutation in the nucleotide sequence. Alleles may result in 
altered mRNAs or polypeptides whose structure or function may or may not be altered. 
Any given gene may have none, one, or many allelic forms (polymorphism). Common 
mutational changes which give rise to alleles are generally ascribed to natural deletions, 
15 additions, or substitutions of nucleotides. Each of these types of changes may occur 
alone, or in combination with the others, one or more times in a given sequence. 

Several different types of polymorphisms have been reported. A restriction 
fragment length polymorphism (RFLP) is a variation in DNA sequence that alters the 
length of a restriction fragment (Botstein et al, Am. J. Hum. Genet. 32:314-331 (1980)). 
2 0 The restriction fragment length polymorphism may create or delete a restriction site, 
thus changing the length of the restriction fragment. RPLPs have been widely used in 
human and animal genetic analyses (see WO 90/13668; WO 90/11369; Donis-Keller, 
Cell 57:319-337 (1987); Lander et al., Genetics 727:85-99 (1989)). When a heritable 
trait can be linked to a particular RFLP, the presence of the RFLP in an individual can 
2 5 be used to predict the likelihood that the individual will also exhibit the trait. 

Other polymorphisms take the form of short tandem repeats (STRs) that include 
tandem di-, tri- and tetra-nucleotide repeated motifs. These tandem repeats are also 
referred to as variable number tandem repeat (VNTR) polymorphisms. VNTRs have 



WO 99/36537 



PCT/US99/00182 



-62- 

been used in identity and paternity analysis (US 5,075,217; Armour et al, FEES Lett. 
3(97:113-115 (1992); Horn et al y WO 91/14003; Jeffreys, EP 370,719), and in a large 
number of genetic mapping studies. 

Other polymorphisms take the form of single nucleotide variations between 
5 individuals of the same species. Such polymorphisms are far more frequent than 
RFLPs, STRs (short tandem repeats) and VNTRs (variable number tandem repeats). 
Some single nucleotide polymorphisms occur in protein-coding sequences, in which 
case, one of the polymorphic forms may give rise to the expression of a defective or 
other variant protein and, potentially, a genetic disease. Other single nucleotide 

10 polymorphisms occur in noncoding regions. Some of these polymorphisms may also 
result in defective protein expression (e.g., as a result of defective splicing). Other 
single nucleotide polymorphisms have no phenotypic effects. 

Many of the methods described below require amplification of DNA from target 
samples and purification of the amplified products. This can be accomplished by PCR, 

1 5 for instance. See generally, PCR Technology, Principles and Applications for DNA 

Amplification (ed. H.A. Erlich), Freeman Press, New York, NY, 1992; PCR Protocols: 
A Guide to Methods and Applications (eds. Innis, et al.), Academic Press, San Diego, 
CA, 1990; Mattila et al, Nucleic Acids Res. 79:4967 (1991); Eckert et al, PCR 
Methods and Applications 7:17 (1991); PCR (eds. McPherson et al, IRS Press, 

2 0 Oxford); and US 4,683,202. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4:560 (1989); Landegren et al, Science 241:1011 

(1988) ), transcription amplification (Kwoh et al, Proc. Natl. Acad. Set USA 86:1113 

(1989) , self-sustained sequence replication (Guatelli et al, Proc. Natl Acad. Sci. USA 
2 5 87: 1874 (1990), and nucleic acid based sequence amplification (NASBA). The latter 

two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double stranded 
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DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1 , 
respectively. 

Another aspect of the invention is a method for detecting a variant allele of a 
human FATP gene, comprising preparing amplified, purified FATP DNA from a 
5 reference human and amplified, purified, FATP DNA from a "test" human to be 
compared to the reference as having a variant allele, using die same or comparable 
amplification procedures, and determining whether the reference DNA and test DNA 
differ in DNA sequence in the FATP gene, whether in a coding or a noncoding region, 
wherein, if the test DNA differs in sequence from the reference DNA, the test DNA 

1 0 comprises a variant allele of a human FATP gene. The following is a discussion of 
some of the methods by which it can be determined whether the reference FATP DNA 
and test FATP DNA differ in sequence. 

Direct Sequencing. The direct analysis of the sequence of variant alleles of the 
present invention can be accomplished using either the dideoxy chain termination 

1 5 method or the Maxam and Gilbert method (see Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Press, New York 1989; Zyskind et al, 
Recombinant DNA Laboratory Manual, Acad. Press, 1 988)). 

Denaturing Gradient Gel Electrophoresis. Amplification products generated 
using the polymerase chain reaction can be analyzed by the use of denaturing gradient 

2 0 gel eletrophoresis. Different alleles can be identified based on the different sequence- 
dependent strand dissociation properties and electrophoretic migration of DNA in 
solution (chapter 7 in Erlich, ed. PCR Technology, Principles and Applications for DNA 
Amplification, W.H. Freeman and Co., New York, 1992). 

Single-strand Conformation Polymorphism Analysis. Alleles of target 

2 5 sequences can be differentiated using single-strand conformation polymorphism 

analysis, which identifies base differences by alteration in electrophoretic migration of 
single stranded PCR products, as described in Orita et al, Proc. Natl. Acad. Sci USA 
£6:2766-2770 (1989). Amplified PCR products can be generated as described above, 
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and heated or otherwise denatured, to form single-stranded amplification products. 
Single-stranded nucleic acids may refold or form secondary structures which are 
partially dependent on the base sequence. The different electrophoretic mobilities of 
single-stranded amplification products can be related to base-sequence differences 
5 between alleles of target sequences. 

Detection of Binding by Protein That Binds to Mismatches. Amplified DNA 
comprising the FATP gene or portion of the gene of interest from genomic DNA, for 
example, of a normal individual is prepared, using primers designed on the basis of the 
DNA sequences provided herein. Amplified DNA is also prepared, in a similar manner, 

10 from genomic DNA of an individual to be tested for bearing a distinguishable allele. 
The primers used in PCR cany different labels, for example, primer 1 with biotin, and 
primer 2 with 32 P. Unused primers are separat6d form the PCR products, and the 
products are quantitated. The heteroduplexes are used in a mismatch detection assay 
using immobilized mismatch binding protein (MutS) bound to nitrocellulose. The 

15 presence of biotin-labeled DNA wherein mismatched regions are bound to the 

nitrocellulose via MutS protein, is detected by visualizing the binding of streptavidin to 
biotin. See WO 95/12689. MutS protein has also been used in the detection of point 
mutations in a gel-mobility-shift assay (Lishanski, A. at al, Proc. Natl Acad. Sci. USA 
91:2614-267% (1994)). 

2 0 Other methods, such as those described below, can be used to distinguish a 

FATP allele from a reference allele, once a particular allele has been characterized as to 
DNA sequence. 

Allele-specific probes. The design and use of allele-specific probes for 
analyzing polymorphims is described by e.g., Saiki et al, Nature 324:163-166 (1986); 
2 5 Dattagupta, EP 235,726, Saiki, WO 89/1 1548. Allele-specific probes can be designed 
so that they hybridize to a segment of a target DNA from one individual but do not 
hybridize to the corresponding segment from another individual due to the presence of 
different polymorphic forms in the respective segments from the two individuals. 
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Hybridization conditions should be sufficiently stringent that there is a significant 
difference in hybridization intensity between alleles, and preferably an essentially 
binary response, whereby a probe hybridizes to only one of the alleles. Some probes are 
designed to hybridize to a segment of target DNA such that the polymorphic site aligns 
5 with a central position (e.g., in a 15-mer at the 7 position; in a 16-mer, at either the 8 or 
9 position) of the probe. This design of probe achieves good discrimination in 
hybridization between different allelic forms. 

Ailele-specific probes are often used in pairs, one member of a pair showing a 
perfect match to a reference form of a target sequence and the other member showing a 
10 perfect match to a variant form. Several pairs of probes can then be immobilized on the 
same support for simultaneous analysis of multiple polymorphisms within the same 
target sequence. 

Allele-specific Primers. An ailele-specific primer hybridizes to a site on target 
DNA overlapping a polymorphism, and only primes amplification of an allelic form to 

1 5 which the primer exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 

1 7:2427-2448 (1989). This primer is used in conjunction with a second primer which 
hybridizes at a distal site. Amplification proceeds from the two primers, resulting in a 
detectable product which indicates the particular allelic form is present. A control is 
usually performed with a second pair of primers, one of which shows a single base 

2 0 mismatch at the polymorphic site and the other of which exhibits perfect 

complementarity to a distal site. The single-base mismatch prevents amplification and 
no detectable product is formed. The method works best when the mismatch is included 
in the 3'-most position of the oligonucleotide aligned with the polymorphism because 
this position is most destabilizing to elongation from the primer (see, e.g., WO 

2 5 93/22456). 

Gene Chips. Allelic variants can also be identified by hybridization to nucleic 
acids immobilized on solid supports (gene chips), as described, for example, in WO 
95/1 1995 and U.S. Patent No. 5,143,854, both of which are incorporated herein by 
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reference. WO 95/11995 describes subarrays that are optimized for detection of a 
characterized variant allele. Such a subarray contains probes designed to be 
complementary to a second reference sequence, which is an allelic variant of the first 
reference sequence. 

5 The present method is illustrated by the following examples, which are not 

intended to be limiting in any way. 

EXAMPLES 
Materials and Methods 

The following Materials and Methods were used in the work described in 

10 Examples 1-5. 

Sequence Alignment of FATP Clones. The DNA sequence for mouse FATP1 
was obtained from the National Center for Biotechnology Information nonredundant 
database. cDNAs for mmFATP2, 3, 4, and 5 were obtained by screening mouse 
expression libraries (purchased from GIBCO/BRL) with probes derived from the cloned 

15 expressed sequence tags (ESTs) (Research Genetics, Huntsville, AL). Full-length 

clones were obtained for mmFATP2 and 5 and partial sequences for mmFATP3 and 4. 
The sequences described herein have been deposited in the GenBank database 
(Accession Nos. FATP2, AF072760; FATP3, AF072759; FATP4, AF072758; FATP5, 
AF072757). 

2 0 Neither FATP2 nor FATP5 contains an in-frame stop codon upstream of the 

putative initiator methionine; initiator methionines were assigned by homology with 
that in mmFATPl and by the presence of a signal sequence immediately after it. The 
Mycobacterium tuberculosis, Caenorhabditis elegans, and Saccharomyces cerevisiae 
sequences were present in the dbEST database as part of the sequencing projects for 

2 5 these organisms. Sequences were aligned utilizing a ClustalX algorithm and the 
resulting alignment exported to SeqVu. Homologous amino acid substitutions are 
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boxed in Figure 1 and were determined using the Dayhoff 250 method with a 50% 
homology cutoff. 

Cell Transfection and LCFA Uptake. COS cells were cotransfected using the 
DEAE-dextran method with the mammalian expression vector pCDNA 3.1 (Invitrogen) 
5 expressing the gene for CD2 (pCDNA-CD2) in combination with either a pCDNA 3.1 
or pCMVSPORT2 (GIBCO/BRL) expression vector containing one of the murine or 
nematode FATP genes {pCDNA-mmFA TP I , pCDNA-FATP2 7 pCMVSPORT-FATP5 y 
pCDNA-ceFATPb). Two days after transfection, cells were assayed for CD2 expression 
with a phycoerythrin-coupled anti-CD2(PE-CD2) monoclonal antibody (PharMingen), 

1 0 and fatty acid uptake was assayed with a BODIPY-labeled fatty acid analogue 

(Molecular Probes). Briefly, cells were washed twice with PBS (phosphate buffered 
saline) and stained with PE-CD2 at 4°C for 30 min in PBS containing 10% fetal calf 
serum. They were then washed three times with PBS/fetal calf serum for 5 min 
followed by an incubation for 2 min at 37°C in fatty acid uptake solution, which 

1 5 contained 0.1 uM BODIPY-FA and 0.1% fatty acid-free BSA (bovine serum albumin) 
in PBS (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:427-436). After 2 min, the cells 
were washed four times with ice-cold PBS/0.1% BSA. The cells were then removed 
from the plates with PBS containing 5 mM EDTA and resuspended in PBS containing 
10% fetal calf serum and 10 mM EDTA. PE-CD2 and BODIPY-FA fluorescence were 

2 0 measured using a FACScan (Becton Dickinson). COS cells were gated on forward 
scatter (FSC) and side scatter (SS). Cells exhibiting more than 300 CD2 fluorescence 
units (dsim) representing 15% of all cells were deemed CD2 positive and their 
BODIPY-FA fluorescence was quantitated. 

E. coli-Based LCFA Uptake Assay. The full-length coding region of mtFATP 

2 5 and a control protein, the mammalian transcription factor TFE3, were subcloned into the 
inducible, prokaryotic expression vector pET (Novagen). Expression was induced with 
1 mM isopropyl p-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced. 
Cells were washed in PBS/0.1% BSA and resuspended in 1 ml PBS/0.1% BSA 



WO 99/36537 



PCT/US99/00182 



-68- 

containing 0. 1 |iM [ 3 H]palmitate (NEN) at 37°C. Uptake was stopped after the 
indicated incubation time by transferring the cells onto filter paper using a cell harvester 
(Brandel, Bethesda, MD). Filters were washed extensively with ice-cold PBS/0.1% 
BSA, and [ 3 H]palmitate was quantitated by scintillation counting. 
5 Northern Blots. Northern blot analysis of murine FATP expression was done 

using poly(A) mRNA blots (Clontech). Probes of each of the FATPs were derived from 
the y untranslated regions of each gene and were <60% identical in sequence. Probes 
were labeled by random priming (Boehringer Mannheim) and hybridized at 65 °C. 
Blots were extensively washed in 0.2% SSC/0.1% SDS at 65°C. 

1 0 Generation of Phylogenetic Trees. Complete and partial sequences for FA TP 

genes from human, rat, mouse, puffer fish, Drosophila melanogaster, C. elegans, S. 
cerevisiae, and M. tuberculosis were aligned using ClustalX. A homologous region of 
48 amino acids (residues 472-519 in mmFATPl) from all of the genes was used to 
determine phylogenetic relationship within ClustalX. Based on these data a 

1 5 phylogenetic tree was generated using Tree View PPC (Figure 5). 

Nomenclature. It is proposed that the FATP genes be given a species specific 
prefix (mm, Mus musculus; hs, Homo sapiens; mt, M tuberculosis; dm, D. 
melanogaster; ce, C. elegans, sc, S. cerevisiae) and numbered such that mammalian 
homologues in different species share the same number but differ in their prefix. Since 

2 0 the two C. elegans genes cannot be paired with a specific human or mouse FATP, they 
have been designated ceFATPa and ceFATPb. 

Example 1 : Identification of Novel Mammalian FATPs 

The National Center for Biotechnology Information EST database was screened, 
using the mouse FATP protein sequence (mmFATPl), to identify novel FATPs. This 
2 5 strategy led to the identification of more than 50 murine EST sequences which could be 
assembled into five distinct contiguous DNA sequences (contigs). One contig was 
identical to the previously cloned FATP, which has been renamed FATP1. Another, 
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which has been renamed FATP2, is the murine homologue of a rat gene previously 
identified by others as a very long chain acyl-CoA synthase (Uchiyama, A., Aoyama, 
T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) Biol Chern. 
277:30360-30365). The other three contigs represented novel genes (FATP3, 4, and 5). 
5 Full-length clones for FATP2 and FATP5 and nearly complete sequences for FATP3 
and 4 (Figure 1) were obtained by screening cDNA libraries made from mouse day 
10.5 embryos and adult liver. Also identified were human homologues for each of the 
murine genes in the EST database. A sixth human gene was also identified; whether 
this gene is also present in the mouse will require additional studies. Map positions are 
10 given in Tables 2 and 3. 

The genetic loci for all of the human genes, with the exception of FATP5 which 
was already mapped as an unknown EST, were determined using the radiation hybrid 
panels. The map positions given below show the distance (in centiRays) from the 
closest framework marker. As a guideline, there are approximately 300kb/cR. 



WO 99/36537 PCT/US99/00182 

-70- 

Table2. Mapping Data for Human Genes 

hsF ATP 1 Chromosome Chr 1 9 

places 13.35 cR from WI-6344 (lod>3.0) 
hsFATP2 Chromosome ChrlS 
5 places 4.92 cR from D15S126 (lod>3.0) 

hsFATP3 Chromosome Chrl 

places 13.24 cR from WI-2862 (lod>3.0) 
hsFATP4 Chromosome Chr9 

places 7.80 cR from WI-9685 (lod>3.0) 
1 0 hsFATP5 unknown EST previously mapped to near D19S418 
hsFATP6 Chromosome Chr5 

places 1.41 cR from WI-4907 (lod>3.0) 

The mouse map is an internal backcross panel consisting of 188 mouse 
backcross DNA's plus 4 controls (B6, Spretus, Fl , Water). The backcross was 

15 constructed by crossing B6 by Spretus animals and then crossing those Fl's back to B6. 
Mapping is accomplished by taking advantage of recombinational events during 
meiosis, and the use of PCR primers to detect the differences (by size or re-annealing 
events) at any given locus between the B6 and Spretus allele. 

For the purposes of mapping, a novel set of primers (gene of interest) is used to 

20 amplify from all 188 DNA's and then typed as being a B6 ("B") or a Spretus ("S"). This 
string of B's and S ! s is entered into the Map Manager program, which does a best fit 
calculation by comparing the string of 188 typings from the gene of interest to all loci 
already extant in the panel, for all 20 chromosomes. The gene of interest is then 
assigned to a particular area on a particular chromosome according to a number of 

2 5 parameters, including the minimalization of double cross-overs, and the highest LOD 
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scores. Indicated in Table 3 are distances to the closest markers on either side of the 
FATP locus. 



Table 3. Mapping Data for Mouse Genes 



mmFATPl Chromosome 8 
5 places 2.82 cM from D8Mitl32 (lod 43.4) and 1 .81 cM from D8Mit74 

(lod43.5) 
mmFATP2 Chromosome 2 

places 1.29 cM from D2Mit258 (lod 47.9) and 1.75 cM from D2NDS3 
(lod 44.9) 
1 0 mmFATP3 Chromosome 3 

places 2.54 cM from D3Mit22 (lod 29.5) and 19.62 cM from D3MU42 
(lod 13.6) 
mmFATP4 Chromosome 2 

places 13.78 cM from D2Mitl (lod 22.9) and 3.85 cM from D2Mit65 
15 (lod 41.9) 

mmFATP5 Chromosome 7 

places 7.28 cM proximal of D7Mit21 (lod 28.3) 



Example 2: Assessment of Function 

The ability of the newly identified mouse genes to function as fatty acid 

2 0 transporters was assessed using a fluorescence-activated cell sorting-based assay. COS 
cells were transiently cotransfected with expression vectors encoding the cell surface 
protein CD2 and either mmFATPl , rnmFATP2, or mmFATP5, respectively. Two days 
after transfection, COS cells were stained with an antibody to CD2 and then incubated 
with a BODIPY-labeled fatty acid [BODIPY-FA, (Schaffer, J.E. & Lodish, H.F. (1994) 

2 5 Cell 79:421-436)]. The cells were then washed extensively, lifted off the dish, and 
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analyzed by fluorescence-activated cell sorting- As judged by the number of CD2- . 
positive cells, the transfection efficiency was approximately 20-30%. Fatty acid uptake 
was quantitated in the transiently transfected COS cells by measuring the BODIPY-FA 
fluorescence of the CD2 -positive cells. Expression of CD2 had no effect on fatty acid 
5 uptake as shown by the finding that COS cells expressing only the transfected CD2 
cDNA (CD2-positive) had the same low level of BODIPY-FA uptake as did 
untransfected (CD2-negative) control cells (Figure 2A, control). In COS cells 
cotransfected with CD2 and rnmFATPl , mmFATP2, or mmFATPS, uptake of 
BODIPY-FA by the transfected (CD2-positive) cells was increased between 15- to 90- 
1 0 fold over control (CD2 cDNA only) cells (Figures 2A-2D). 

Example 3: Expression Patterns of Murine FATPs 

Expression patterns of members of the murine FATP gene family were 
characterized by Northern blot analysis; to avoid cross-hybridization, the probes used 
were from the 3' untranslated region of these genes, which are less than 60% identical in 

15 sequence. The expression pattern of FATP 1 agrees with that previously found 

(Schaffer, J.E. & Lodish, H.F. (1994) Cell 72:427-436). Here, expression was seen 
primarily in heart and kidney. FATP2 is expressed almost exclusively in liver and 
kidney, which corresponds to the reported tissue distribution of the rat homologue [very 
long chain acyl-CoA (VLACS)] as assessed by Western blotting (Uchiyama, A., 

2 0 Aoyama, T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) J. 
Biol Chem. 277:30360-30365). FATP3 is present in lung, liver, and testis. FATP5 is 
expressed only in liver and cannot be detected in other tissues even when the blot is 
overexposed. The human homologue of FATP 5 is also liver specific and is not 
expressed in a wide array of other tissues tested, including fetal liver. 
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Example 4: FATPs Are Evolutionarily Conserved 

The EST database was searched, using sequences conserved among the five 
murine FATP genes, for FATP genes in other organisms. Two homologues were found 
in C. elegans and one in M. tuberculosis. One of the C. elegans genes was cloned from 
5 a cDNA library and expressed in COS cells, as described for the murine FATPs. 

Overexpression of the nematode FATP resulted in a 15-fold increase of BODIPY-FA 
uptake compared with control cells (Figure 3). The mycobacterial FATP gene was 
isolated from a phage library and assessed for its ability to facilitate fatty acid uptake. 
E. coli transformed with a prokaryotic, isopropyl P-D-thiogalactoside-inducible 
1 0 expression vector containing the mycobacterial FATP gene demonstrated a significant 
increase in the rate of [ 3 H]palmitate uptake after induction, compared with uninduced 
bacteria or E. coli transformed with a control protein (Figure 4). Novel FATP genes 
were also identified in F. rubripes (puffer fish) and D. melanogaster. 

Example 5: Phylogenetic Tree of FATPs 

1 5 Faergeman et ah (Faergeman, N.J., DiRusso. C.C., Elberger, A., Knudsen, J. & 

Black, P. N. (1997) J. Biol Chem. 272:8531-8538) identified three regions of very 
strong conservation between the scFATP and mmFATPl genes. The sequences of the 
FATPS were compared over a 31 1 -amino acid FATP "signature sequence" which 
includes these conserved regions corresponding to amino acids 246-557 in mmFATPl 

2 0 (underlined in Figure 1). When compared with the National Center for Biotechnology 
Information nonredundant database, only one region of the "FATP signature sequence" 
shows significant homology to other proteins. This small stretch of amino acids 
(underlined in Fig. 1) is an AMP -binding motif found in a multitude of other proteins, 
such as acyl-CoA synthase, several CoA lipases, and gramicidin S synthetase 

25 component II (Schaffer, J.E. & Lodish, H.F. (1994) Cell 7P:427-436). The relevance of 
this motif to fatty acid transport is unclear. Other highly conserved regions among the 
FATPs, including long stretches of amino acids >90% identical from mycobacteria to 
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humans, are not found in any other class of proteins. A 48-amino acid segment of the. 
FATP signature sequence was used to construct a phylogenetic tree (Figure 5). Each of 
the human and mouse genes form their own branch; hsFATP6, which as yet has no 
murine homologue, is most closely related to hsFATP3 and mmFATP3. As expected, 
5 rnVLACS is closer in sequence to mmFATP2 than to hsFATP2. The FATP genes of 
invertebrates i.e., C elegans and D. melanogaster, are most closely related to each 
other. Surprisingly, the mycobacteral gene is more closely related to the human and 
mouse FATP 5 genes than to the FATPs of any of the lower organisms. Whether this 
reflects coevolution of the mycobacterial and human genes awaits further study. 



10 Materials and Methods 

The following materials and methods were used in the work describedun 
Examples 6-10. 

Isolation of full-length human FATP1 and 4 

Full-length clones encoding human FATP1 and humau FATP4 were identified 
15 by searching databases for sequences similar to murine FATP 1-5 coding regions using 

the BlastX algorithm (Altschul et aL 9 J. MoL Biol 215: 403-410, 1990). 

A concatamer of nucleotide sequences comprising the coding sequences of 

mmFATPl (Genbank Accession Ul 5976), mmFATP2, mmFATP3 (SEQ ID NO:6), 

mmFATP4 (SEQ ID NO; 8) and mmFATP5 (SEQ ID NO: 10) was used to search the 
20 Millennium database using the BLASTX algorithm. Sequences with a score >1 50 were 

evaluated for whether they represented known FATP coding sequences. 

Human clones with similarity to the 5 1 end of murine FATP sequences were 

sequenced completely. Clones encoding full-length human FATP1 were obtained from 

a heart cDNA library constructed in the mammalian expression vector pMET7 
25 (Tartaglia et al t Cell 83: 1263-1271, 1995). Clones encoding full-length human 

FATP4 were obtained from a spleen cDNA library constructed in the mammalian 

expression vector pMET7. 
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Isolation of full-length human FATP6 

Several clones encoding human FATP6 were identified by searching public 
databases as described above. Five clones were analyzed further by restriction digestion ' 
and DNA sequencing. One of these clones (Genbank Accession # AA4 12064) appeared 
5 to be full-length and its entire insert was sequenced. 

DNA Sequence Analysis 

Sequences were aligned with the DNAStar program using the Ciustal method. 
Hydrophobicity plots were generated with DNA Strider using the Kyte Doolittle 
method. 

10 In situ hybridization 

Tissues were collected from 8 week old C57/B16 mice. Tissues were fresh 
frozen, cut on a cryostat at 10 \im thickness and mounted on Superfrost Plus slides 
(VWR). Sections were air dried for 20 minutes and then incubated with ice cold 4% 
paraformaldehyde (PFA)/phosphate buffered saline (PBS) for 10 minutes. Slides were 

15 washed 2 times 5 minutes with PBS, incubated with 0.25% acetic anhydride/1 M 

triethanolamine for 10 minutes, washed with PBS for 5 minutes and dehydrated with 
70%, 80%, 95% and 100% ethanol for 1 minute each. Sections were incubated with 
chloroform for 5 minutes. Hybridizations were performed with 35 S-radio labeled (5xl0 7 
cpm/ml) cRNA probes generated from the 3' untranslated regions of mouse FATPs by 

2 0 PCR followed by in vitro transcription in the presence of 50% formamide, 10% dextran 
sulfate, lx Denhardfs solution, 600 mM NaCl, 10 mMDTT, 0.25% SDS and 10 fig/ml 
tRNA for 1 8 hours at 55 °C. After hybridization, slides were washed with 1 0 mM Tris- 
HC1 pH 7.6, 500 mM NaCl, 1 mM EDTA (TNE) for 10 minutes, incubated in 40 
M-g/ml RNase A in TNE at 37°C for 30 minutes, washed in TNE for 10 minutes, 

2 5 incubated once in 2x SSC at 60°C for 1 hour, once in 0.2x SSC at 60°C for 1 hour, 
once in 0.2x SSC at 65 °C for 1 hour and dehydrated with 50%, 70%, 80%, 90% and 
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100% ethanol. Localization of mRNA transcripts was detected by dipping slides in . 
Kodak NBT-2 photoemulsion and exposing for 7 days at 4°C, followed by development 
with Kodak Dektol developer. Slides were counter stained with haematoxylon and 
eosin and photographed. Controls for the in situ hybridization experiments include the 
5 use of a sense probe which showed no signal above background in all cases. 

Northern Blotting 

Human mRNA blots were obtained from Invitrogen or Clontech. PCR fragments 
from the 3' untranslated regions of human FATPs were used as probes. Blots were 
probed with 32 P-iabeled DNA probes using the Rapid-Hyb buffer (Amersham) 

1 0 according to the manufacturer's instructions. 

Cell transfection and LCFA uptake. COS cells were cotransfected, using 
lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 
combination with apMET7 expression vector (Tartaglia et aL, Cell, 83:1263-1271, 

1 5 1 995) containing hsFATP 1 (pMET7-hsFATP 1 ) or hsFATP4 (pMET7-hsFATP4) or 

pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
a phycoerythrin-coupled anti-CD2 (PE-CD2) monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
Probes) as described above. 

2 0 Example 6: Determination of Expression of mmFATPs 

mmFATP4, and to lesser extent mmFATP2, are expressed at high levels in the 
brush border layer of the small intestine. 

Cell transfection and LCFA uptake. COS cells were cotransfected, using 
lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
2 5 mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 
combination with a pMET7 expression vector (Tartaglia etai, Cell, 83:1263-1271, 
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1995) containing hsFATPl (pMET7-hsFATPl) or hsFATP4 (pMET7-hsFATP4) or 
pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
a phycoerythrin-coupled anti-CD2 (PE-CD2) monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
5 Probes) as described above. 

Absorption of dietary fat requires transport of free fatty acids across the apical 
membrane of epithelial cells in the small intestine. Previous studies suggested that this 
transport is protein-mediated; however, the transport protein had not yet been identified. 
In situ hybridization was performed on each of the three regions of the small intestine ~ 

10 duodenum, jejunum and ileum — as well as the colon, using probes from the 3* 
untranslated regions of mmFATPl, mmFATP2, mmFATP3, mmFATP4 and 
mmFATPS, to determine whether any of the mouse FATPs are expressed in the small 
intestine. It was expected that a protein involved in fatty acid absorption would be 
expressed in the epithelial cells of the small intestine, but absent from the colon. 

15 Expression of mmFATPs in the jejunum was identical to that in the ileum in all 

cases. High levels of mmFATP4 mRNA were present in the epithelial cells of the 
jejunum and ileum, and lower, but significant, amounts were detected in the epithelial 
cells of the duodenum. Significantly, FATP4 mRNA was absent from other cell types 
of the small intestine and no FATP4 mRNA could be detected in axiy of the cells of the 

2 0 colon. FATP2 mRNA was present in the epithelial cells of the duodenum at a level 

similar to that of FATP4, but was present at lower levels in the jejunum and ileum. No 
signals above background were detected for mmFATPl, mmFATP3 and mmFATP5 in 
any of the intestinal tissues. rnmFATP3 and FATPS were clearly detectable by in situ 
hybridization in adult liver and mmFATPl could be detected in a variety of tissues on a 

2 5 whole embryo in situ, indicating that the FATP1, 3, and 5 probes were working. 

mmFATP4 expression is predominant in the small intestine compared to the 
other organs of the mouse embryo. In the small intestine, FATP4 expression is limited 
to differentiated enterocytes, while no signal is detected in the connective tissue or the 
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undifferentiated epithelial cells in the crypts. Differentiated enterocytes are known to be 
the cells that mediate the uptake of fatty acids. FATP4 is specifically and strongly 
expressed in the epithelial cells of adult murine duodenum and ileum but not colon. 
Other FATPs, such as FATP5, are not expressed in the small intestine. Thus, FATP4 is 
5 the major FATP in the mouse small intestine. Given its high level of expression, it is 
likely that FATP4, and to a lesser extent FATP2, play an important role in the 
absoiption of fatty acids. 

mmFATP2, and mmFATPS are expressed in hepatocytes 

Northern analysis of mmFATP2, mmFATP3, mmFATP4 and mmFATP5 

1 0 showed expression in the liver. To determine whether these proteins are present in 
hepatocytes or other cells types present in liver homogenates, in situ hybridizations 
were performed. mmFATP2, and mmFATPS mRNA was clearly present in 
hepatocytes, and was not concentrated in other cell types such as endothelial cells or 
macrophages. No signal above background was detected for mmFATPl in any of the 

1 5 cell types in the liver, consistent with the results of the Northern blotting. 

Example 7: Isolation and Sequence Analysis of Full-length Human FATP1 and Full- 
length Human FATP4 

To identify human cDNA clones encoding FATP family members, Millennium 
databases were searched for sequences similar to murine FATP 1-5 coding regions. Two 

2 0 clones were analyzed in detail; inspection of the entire DNA sequence of these two 
clones showed that they encode the human orthologs of mmFATPl and mm FATP4, 
respectively. These two clones were designated hsFATPl and hsFATP4, and their 
DNA and predicted protein sequences are shown in Figures 44A-44C and 45, and 50A- 
50C and 51. hsFATPl is predicted to encode a 646 amino acid, 71 kD protein with 

25 multiple membrane-spanning domains (Figure 28A). HsFATP4 is predicted to encode a 
643 amino acid, 72 kD protein with multiple membrane spanning domains (See Figure 
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29 A). A comparison of the DNA sequences of mouse and human FATP1 and mouse 
and human FATP4 (Figures 30A-30B and 31A-31B) shows that the mouse and human 
orthologs are 85% (FATP1) and 87% (FATP4) identical to each other within the coding 
sequences given in these figures. At the amino acid level, hsFATPl and hsFATP4 are 
5 -90% identical to their respective mouse orthologs within the coding region shown in 
these figures (Figures 32 and 33). The sequence identities between mouse and human 
FATP1 and FATP4 are considerably higher than the ones observed between different 
FATP family members within one species (~40%-60%) and are present in the N- 
terminal part of the protein, a region that is poorly conserved between different FATP 

10 family members. This high degree of sequence conservation clearly demonstrates that 
the newly identified human FATPs are orthologs of mouse FATP1 and FATP4 rather 
than novel FATP family members. 

Table 4 is an identity/similarity matrix comparing the amino acid sequences of 
FATP1 and 4 from human and mouse. This shows that the gene whose sequence is 

15 shown in Figure 43 A is indeed human FATP4, since it is 91% identical with the murine 
FATP4 but only 62% identical with the closest related human FATP, which is FATP1. 



Table 4 


Identity/Similarity Matrix 




hsFATP4 


mmFATP4 


hsFATPl 


mmFATPl 


hsFATP4 




93.2 


72.3 


72.0 


mmFATP4 


91.0 




71.2 


71.1 


hsFATPl 


61.9 


61.0 




92.4 


mmFATPl 


60.7 


59.6 


89.5 





Example 8: Isolation and Sequence Analysis of Full-length Human FATP6 

A search of EST databases identified a set of overlapping human sequences that 
2 5 were similar to FATPs, but did not have a clear mouse ortholog. One of these EST 
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clones was found to encode a full-length cDNA. The entire insert of this clone was 
sequenced and designated hsFATP6. The DNA and predicted protein sequences of 
hsFATP6 are shown in Figures 54A-54C and 55. HsFATP6 is predicted to encode a 
619 amino acid, 70 lcD protein with multiple membrane-spanning domains (Figure 
35A). A comparison of the amino acid sequences of hsFATP6 with other human 
FATPs shows about 37% identity to either hsFATPl or hsFATP4 (Figure 36). This 
degree of sequence identity is similar to what is observed between different mouse 
FATPs. The phylogenetic analysis described above clearly demonstrates that hsFATP6 
is a member of the FATP family, but not an ortholog of any of the mouse FATPs. 
Comparisons were done with "ALIGN" (E. Myers and W. Miller, "Optimal Alignments 
in Linear Space," CABIOS 4:1 1-17 (1988) using standard settings. 

Example 9: Tissue Distribution of Human FATPs 

The tissue distribution of human FATPs was assessed by Northern blotting. 
Human FATP3 was expressed in a large variety of tissues. In contrast, human FATP5 
15 was present at high levels in the liver, but was undetectable in all other tissues 

examined. Thus, both hsFATP3 and hsFATP5 recapitulate the expression pattern of 
their mouse orthologs (see above). HsFATP6 is a novel FATP with no mouse ortholog 
as yet. Northern blotting shows that hsFATP6 is expressed at high levels in the heart, 
but is undetectable in other tissues, including skeletal and smooth muscle. This tissue 
2 0 distribution suggests that human FATP6 performs an important role in energy 

metabolism in the heart; blocking FATP6-mediated fatty acid transport may therefore 
be beneficial for a number of heart diseases, e.g., ischemic heart disease. 

To identify the major FATP expressed in the human small intestine, Northern 
blotting was performed on a blot containing mRNA from human stomach, jejunum, 
25 ileum, colon, rectum and lung. hsFATP5 and hsFATP6 were undetectable in any of 
these tissues. FATP5 is only expressed in liver and FATP6 only in heart. hsFATP2 
was weakly expressed in the colon, and an even weaker signal was detectable in 
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jejunum, ileum and lung lanes. hsFATP3 was expressed well in the lung, but was only 
weakly expressed in the other tissues tested. Importantly, no difference was seen in the 
expression of hsFATP3 between small intestine and stomach or colon, suggesting that 
the expression observed is not related to fatty acid absorption in the small intestine. 
5 hsFATTM was clearly expressed in both jejunum and ileum; expression was 

significantly lower in tine colon and was absent in the stomach. This expression pattern 
is consistent with a major role for FATP4 in absorption of fatty acids in the human gut. 

Example 10: Expression of hsFATPl and hsFATP4 Promotes Transport of Fatty Acids 
COS cells were cotransfected using lipofectamine with the mammalian 

1 0 expression vector pCDNA-CD2 in combination with one of the F ATP -containing 

expression vectors (pMET7-hsFATP 1 or pMET7-hsFATP4) or an insertless expression 
vector (pMET7, control) as described in Materials and Methods for Examples 6-10. 
COS cells were gated on forward scatter and side scatter. Cells exhibiting more than 
400 CD2 fluorescence units representing -30% of all cells were deemed CD2 -positive. 

1 5 The percent of CD2-positive cells exhibiting a BODIPY-fluorescence of >300 is plotted 
for the three different vectors tested (Figure 37). 

Example 1 1: Stable Expression of Human FATP4 in 293 Cells 

Stable cell lines were generated as follows. A DNA fragment containing the 
entire hsFATP4 coding sequence as well as 100 nucleotides of 5' and 50 nucleotides of 

20 3' untranslated region was inserted into the vector pIRES-neo (Clontech) using standard 
cloning techniques. The resulting construct or a vector control (pIRES-neo) was 
transfected into 293 cells using the lipofectamine method (Gibco BRL) according to the 
manufacturer's directions. Cells that had taken up the DNA were selected with 1 mg/ml 
G418 (Gibco BRL). Single colonies were picked 1 to 2 weeks after transfection and 

2 5 grown in medium containing 0.8 mg/ml G41 8. Colonies were screened for the ability to 
take up fatty acids by measuring uptake of a fluorescently labeled fatty acid (BODIPY- 
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FA). About 40 colonies transfected with the pIRES-neo containing FATP4 and -20 
colonies transfected with pIRES-neo control were analyzed. All 20 of the vector control 
clones showed amounts of BODIFY-FA uptake similar to each other and to 
untransfected 293 cells. In contrast, among the 40 FATP4 transfected clones, 3 had a 5- 
5 to 10-fold increased BODIPY-FA uptake compared to any of the vector controls, and a 
large number (-20) showed an approximately two-fold increase in BODIPY-FA levels. 
This distribution is consistent with FATP4 conferring increased fatty acid uptake in 
these cells. One of the cell lines with the highest amount of BODIPY-FA uptake was 
selected to be used for measuring uptake of tritiated fatty acid. 
10 The uptake of tritiated oleate over time by either FATP4 expressing or control 

cells was assayed over time. Expression of FATP4 increases the rate of fatty acid 
uptake by over 3-fold, demonstrating that FATP4 is, like the other FATPs, a functional 
fatty acid transporter (Figure 38). 

Example 12: Immuno-staining with FATP4-Specific Antiserum 
15 A polyclonal antiserum against the C- terminus of mmFATP4 was raised using a 

GST- fusion protein having mmFATP4-specific amino acid sequence 552-643 
(AVASP...GEEKL). In western blot experiments, the purified antibody reacted strongly 
with a synthetic peptide matching the C-terminus of mmFATP4, but not with a 
corresponding region of mmFATP2, mmFATP3, or mmFATP5. The mmFATP4 
2 0 specific polyclonal antiserum detects, in western blot experiments with enterocyte 
lysates from 3 different mice, a -70 kDa protein, which is in accordance with 
mmFATP4's predicted molecular weight of 72 kDa. The binding is specific for 
mmFATP4, since it can be completely abolished by preincubation of the antiserum with 
the GST-fusion peptide used to raise the antibody. 
2 5 Immunofluorescence experiments were performed using the anti-mmFATP4 

antiserum on fresh frozen sections of murine small intestine. The antibody binding 
demonstrates strong expression of mmFATP4 in enterocytes, confirming the results of 
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the in situ hybridization experiments. At higher magnifications it is apparent that 
mmFATP4 is expressed at the apical side of the enterocyte, indicating that the 
transporter is present in the brush border membrane, which is known to mediate the 
uptake of fatty acids from the intestinal lumen. 
5 Immuno-electron microscopy studies were performed on fresh frozen murine 

intestinal cells. The gold particles used, appearing as black specks on the election 
micrographs, indicate the subcellular localization of mm'FATP4 to be on the microvilli 
of the enterocyte. It can be seen from the electron micrographs that mmFATP4 is 
localized exclusively in membranes, preferentially the apical plasma membrane, 
10 confirming that it is indeed a membrane protein. 

Example 13: Inhibition of Fatty Acid Uptake Specific to FATP4 Demonstrated in 
Isolated Mouse Enterocytes 

Phosphorothioate derivatives of the following oligonucleotides were 
synthesized: 

1 5 FATP4-AS2 CCCCCACCAGAGAGGCTCC (SEQ ID NO: 1 00) 

FATP4-AS2MM CC ACCCCCGGAAAGCCTGC (SEQ ID NO: 1 0 1 ) 
FATP4-S2 GGAGCCTCTCTGGTGGGGG (SEQ ID NO: 1 02) 

FATP4 AS2 is the antisense oligo; it is designed to be complementary to the sequence 

extending from nucleotide 10 to nucleotide 28 of the mouse FATP4 coding sequence. 
2 0 FATP4-AS2MM is a control oligo; in the oligo every third nucleotide was changed 

creating mismatches; the overall nucleotide composition is identical to FATP4-AS2 

(same number of G, A, T, C). FATP4-S2 is the sense control. 

Enterocytes were isolated from the small intestine of mice and incubated for 48h 

in tissue culture (Figure 40) either without oligonucleotides (squares) or with 1 00 uM 
25 FATP4 specific sense (circles) or antisense (diamonds) oligonucleotides. The uptake 

over time of 25 uM oleate was then measured. While the FATP4 sense oligonucleotide 
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did not significantly influence the uptake, the antisense oligonucleotide inhibited fatty 
acid uptake by - 50%. 

The effect of either F ATP4 sense, antisense or mismatch sequence 
oligonucleotides on the uptake of fatty acids was measured in enterocytes. Isolated 

5 enterocytes were incubated with increasing concentrations of FATP4 antisense 

oligonucleotides (solid bars in Figure 41), or a mismatch control oligonucleotide with 
identical nucleotide composition (stippled bars), or with 100 jxM of the FATP4 sense- 
oligonucleotide (lined bar). The medium for this incubation was Dulbecco's modified 
Eagle's medium with 4.5 g/L glucose, 1 mM sodium pyruvate, 0.01 mg/ml human 

1 0 transferrin and 10% fetal bovine serum. After 48 hours of incubation the uptake of 

oleate by enterocytes was measured over a 5 minute time interval. Measurements were 
done in quadruplicate. The uptake assay was done in Hank's buffered salt solution with 
10 mM taurocholate. Only the enterocytes given FATP4 antisense oligonucleotide 
showed a concentration dependent decrease of fatty acid uptake, inhibiting it at a 100 

1 5 uM concentration by -50%. This effect was FATP4 specific, since only the antisense 
oligonucleotide which can bind to the FATP4 mRNA and block its translation inhibited 
uptake, but not a control oligonucleotide differing only in the sequence but not the 
nucleotide content, ruling out a toxic or otherwise nonspecific inhibitory effect of this 
oligonucleotide due to its chemical composition. 

2 0 As a further control experiment, the uptake of oleate was measured along with 

the uptake of methionine in the same cultured enterocytes. Antisense oligonucleotide, 
mismatch sequence oligonucleotide, or no oligonucleotide was added to a concentration 
of 100 jliM to cultures of enterocytes. After incubation for 48 hours, the uptake of both 
3 H-labeled oleate and 35 S-labeled methionine was assayed. Results are shown in Figure 

2 5 . 42. Fatty acid uptake is at the left side of the paired bars; methionine uptake is on the 
right side of the paired bars. The fact that amino acid uptake was not influenced by the 
antisense oligonucleotide treatment further supports the conclusion that the antisense 
oligonucleotide causes a specific reduction in translation of FATP4-specific mRNA. 
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Example 14: mmFATP2 Is Expressed in Proximal Renal Tubule Epithelium 

Northern analysis showed that mmFATPl, mmFATP2, and mmFATP4 are 
present in the kidney. In situ hybridization (methods as for Example 6) was performed 
to determine which cell type(s) of the kidney these mRNAs are expressed in. 
5 mmFATPl mRNA was present in virtually all cells throughout the kidney with no 
obvious preference for a particular cell type. In contrast, mmFATP2 was expressed 
only in the renal cortex. Within the cortex, expression of mmFATP2 was restricted to 
the epithelial cells of the proximal renal tubules. The primary function of proximal 
renal tubule cells is the reabsorption of filtered salts and nutrients (e.g., glucose), a 

10 process that requires mitochondrial oxidation and that can utilize fatty acids as energy 
substrates. Based on the localization of mmFATP2, it is possible that mmFATP2 is 
important for reabsorption in the kidney by allowing uptake of an energy source (fatty 
acids) from the blood into renal epithelial cells. Alternatively, if fatty acids need to be 
reabsorbed in the kidney, similarly to glucose, FATP2 could be involved in the 

1 5 reabsorption of fatty acids. Determination of the subcellular localization of FATP2 will 
distinguish between these two possibilities. 

Table 5 summarizes data on expression of the mouse FATPs in various organs. 
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Example 1 5 : Isolation of full-length human FATP3 

Full-length clones encoding human FATP3 were identified by searching 
databases for sequences similar to the murine FATP1-5 coding regions using the BlastX 
algorithm (Aitschul et al. 9 J. Mol Biol 215: 403-410, 1990). Human clones with 
5 similarity to the 5' end of murine FATP sequences were sequenced completely. A clone 
encoding full-length human FATP3 was obtained from a human bone library 
constructed in the mammalian expression vector pMET7 (Tartagiia, L.A. et al, Cell 83: 
1263-1271, 1995). To identify human cDNA clones encoding FATP family members, 
databases were searched for sequences similar to murine FATP1-5 coding regions. One 

10 clone was found to encode the human ortholog of mmFATP3 and was designated 

hsFATP3. The DNA and predicted protein sequences of hsFATP3 are shown in Figures 
94A and 94B. hsFATP5 is predicted to encode a 703 amino acid 75.6 kD protein with 
multiple membrane-spanning domains. A comparison of the DNA sequences of mouse 
and human FATP3 shows that the mouse and human orthologs are 81% identical to 

15 each other within the coding region. At the amino acid level, hsFATP3 is -86% 
identical to mm FATP3 within the coding region. The sequence identities between 
mouse and human FATP3 are considerably higher than those observed between 
different FATP family members within one species (-40%) and are present in the N- 
terminal part of the protein, a region that is poorly conserved between different FATP 

2 0 family members. 

All references cited herein are incoiporated by reference in their entirety. 
While this invention has been particularly shown and described with references 
to preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 

25 spirit and scope of the invention as defined by the appended claims. 
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CLAIMS 



What is claimed is: 



1 . An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

5 a) a nucleotide sequence which encodes a protein consisting of the amino 

acid sequence of FATP2 in SEQ ID NO:49; 

b) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP4 in SEQ ED NO:53; and 

c) a nucleotide sequence which encodes a protein consisting of die amino 

1 o acid sequence of FATP6 in SEQ ID NO:57. 

2. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) the nucleotide sequence in SEQ ID NO:48; 

b) the nucleotide sequence in SEQ ID NO:52; and 
15 c) the nucleotide sequence in SEQ ID NO:56. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP2 in SEQ ID NO:48; 

2 0 b) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP4 in SEQ ID NO:52; and 

c) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP6 in SEQ ID NO:56. 
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An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP2; 

b) a nucleotide sequence which consists of the coding region of FATP4; 
and 

c) a nucleotide sequence which consists of the coding region of FATP6. 



5. An isolated nucleic acid molecule comprising at least 15 contiguous nucleotides 
of a nucleotide sequence selected from the group consisting of : 

a) SEQ ID NO:48, or of the complement thereof; 
10 b) SEQ ID NO:52, or of the complement thereof; and 
c) SEQ ID NO: 56, or of the complement thereof. 

6. A n isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a contiguous portion of at least about 15 amino acids of a sequence 
selected from the group consisting of SEQ ID NO:48, SEQ ID NO:52, and SEQ 

15 IDNO:56. 

7. An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 

2 0 sequence selected from the group consisting of SEQ ID NO:48, SEQ ID NO:52, 

and SEQ ID NO: 56. 



8. 



An isolated nucleic acid molecule having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide comprising an amino acid 
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sequence selected from the group consisting of SEQ ID NO:49, SEQ ID NO:53, 
and SEQIDNO:57. 

9. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide, wherein said nucleotide sequence is at least 95% similar to the 

5 nucleotide sequence of a nucleotide sequence selected from the group consisting 

of SEQ ID NO:48, SEQ ID NO:52, and SEQ ID NO:56. 

10. An isolated nucleic acid encoding a fatty acid transport protein having an amino 
acid sequence sharing at least about 95% amino acid sequence similarity with an 
amino acid sequence selected from the group consisting of SEQ ID NO:49, SEQ 

10 ID NO:53, and SEQ ID NO:57. 

11. An isolated nucleic acid molecule encoding a fusion polypeptide, said nucleic 
acid molecule comprising a nucleotide sequence encoding a portion of an amino 
acid sequence selected from the group consisting of SEQ ID NO:49, SEQ ID 
NO:53, and SEQ ID NO:57, and further comprising a nucleotide sequence 

1 5 encoding a heterologous portion of said fusion polypeptide. 

12. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein comprising the amino 

acid sequence of FATP2 in SEQ ID NO:49; 
2 0 b) a nucleotide sequence which encodes a protein comprising the amino 

acid sequence of FATP4 in SEQ ID NO:53; and 
c) a nucleotide sequence which encodes a protein comprising the amino 

acid sequence of FATP6 in SEQ ID NO;57. 
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13. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) the nucleotide sequence of FATP2 in SEQ ID NO:48; 

b) the nucleotide sequence of FATP4 in SEQ ID NO: 52; and 
5 c) the nucleotide sequence of FATP6 in SEQ ID NO:56. 

14. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP2 in SEQ ID NO:48; 
10 b) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP4 in SEQ ID NO:52; and 

c) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP6 in SEQ ID NO:56. 

15. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
1 5 from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP2; 

b) a nucleotide sequence which consists of the coding region of FATP4; 
and 

c) a nucleotide sequence which consists of the coding region of FATP6. 



20 16. A host cell comprising the vector of Claim 1 5 . 



17. An isolated nucleic acid molecule comprising a nucleotide sequence which 

encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 
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nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:48, SEQ ID NO:52, and SEQ ID NO: 56. 

18. A vector comprising a nucleic acid comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
5 amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 

molecule hybridizes under high stringency conditions to a complement of a 
nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:48, SEQ ID NO:52, and SEQ ID NO:56. 



19. A host cell comprising the vector of Claim 8. 

10 20. A method for producing a polypeptide which is a natm*ally occurring allelic 
variant of a polypeptide consisting of the amino acid sequence of a fatty acid 
transport protein, said method comprising cultuiing the host cell of Claim 19 
under conditions in which the nucleic acid molecule is expressed, thereby 
producing the polypeptide. 

15 21. A vector comprising a nucleic acid having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide consisting of an amino acid 
sequence selected from the group consisting of SEQ ID NO:49, SEQ ID NO:53, 
and SEQIDNO:57. 



22. A host cell comprising the vector of Claim 21 . 



20 23. 



A method for producing a polypeptide, said method comprising culturing the 
host cell of Claim 22 under conditions in which the nucleic acid molecule is 
expressed, thereby producing the polypeptide. 
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24. A vector comprising a nucleic acid encoding a fatty acid transport protein having 
an amino acid sequence sharing at least about 95% amino acid sequence 
similarity with an amino acid sequence selected from the group consisting of 
SEQ ID NO:49, SEQ ID NO:53, and SEQ ID NO:57. 

5 25. A host cell comprising the vector of Claim 24. 

26. A method for producing a fatty acid transport protein, said method comprising 
cul hiring the host cell of Claim 25 under conditions in which the nucleic acid 
molecule is expressed, thereby producing the fatty acid transport protein. 

27. A vector comprising a nucleic acid encoding a fusion polypeptide, said nucleic 

1 o acid comprising a nucleotide sequence which encodes a contiguous portion of at 

least about 15 amino acids of a sequence selected from the group consisting of 
SEQ ID NO:49, SEQ ID NO:53, and SEQ ID NO:57. 

28. A host cell comprising the vector of Claim 27. 

29. A method for producing a fusion polypeptide, said method comprising culturing 
15 the host cell of Claim 28 under conditions in which the nucleic acid is expressed, 

thereby producing the fusion polypeptide. 

30. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein consisting of the amino 

2 0 acid sequence of FATP1 in SEQ ID NO:47; 

b) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP3 in SEQ ID NO:51; and 
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c) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP5 in SEQ ED NO:102. 

31. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

5 a) the nucleotide sequence in SEQ ID NO:46; 

b) the nucleotide sequence in SEQ ID NO:50; and 

c) the nucleotide sequence in SEQ ID NO: 101 . 

32. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

10 a) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP1 in SEQ ID NO:46; 

b) a nucleotide sequence which is complement ary to the nucleotide 
sequence of FATP3 in SEQ ID NO:50; and 

c) a nucleotide sequence which is complementary to the nucleotide 
15 sequence of FATP5 in SEQ ID NO: 101. 

33. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP 1 ; 

b) a nucleotide sequence which consists of the coding region of FATP3; 
2 0 and 

c) a nucleotide sequence which consists of the coding region of FATP5. 



34. 



An isolated nucleic acid molecule comprising at least 15 contiguous nucleotides 
of a nucleotide sequence selected from the group consisting of: 
a) SEQ ID NO:46, or of the complement thereof; 



m 
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b) SEQ ID NO:50, or of the complement thereof; and 

c) SEQ ID NO: 1 01 , or of the complement thereof. 

An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a contiguous portion of at least about 15 amino acids of a sequence 
selected from the group consisting of SEQ ID NO:47, SEQ ID NO:51, and SEQ 
ID NO: 102. 

An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 
sequence selected from the group consisting of SEQ ID NO:46, SEQ ID NO:50, 
and SEQ ID NO: 101. 

An isolated nucleic acid molecule having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:47, SEQ ID NO:51, 
and SEQ ID NO: 102. 

An isolated nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide, wherein said nucleotide sequence is at least 90% identical to the 
nucleotide sequence of a nucleotide sequence selected form the group consisting 
of SEQ ID NO:46, SEQ ID NO:50, and SEQ ID NO:101, and wherein said 
percent identity is calculated using the GAP program in the GCG software 
package, using a gap weight of 5 .000 and a length weight of 0. 100. 
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39. An isolated nucleic acid encoding a fatty acid transport protein having an amino 
acid sequence sharing at least about 95% amino acid sequence similarity with an 
amino acid sequence selected from the group consisting of SEQ ID NO:47, SEQ 
ID NO:51, and SEQ ID NO: 102. 

5 40. An isolated nucleic acid molecule encoding a fusion polypeptide, said nucleic 

acid molecule comprising a nucleotide sequence encoding a portion of an amino 
acid sequence selected from the group consisting of SEQ ID NO:47, SEQ ID 
NO:51, and SEQ ID NO: 102, and further comprising a nucleotide sequence 
encoding a heterologous portion of said fusion polypeptide. 

10 41 . A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein comprising the amino 
acid sequence of FATP1 in SEQ ID NO:47; 

b) a nucleotide sequence which encodes a protein comprising the amino 
15 acid sequence of FATP3 in SEQ ID NO:51; and 

c) a nucleotide sequence which encodes a protein comprising the amino 
acid sequence of FATP5 in SEQ ID NO: 102. 

42. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 
2 0 a) the nucleotide sequence of FATP 1 in SEQ ID NO:46; 

b) the nucleotide sequence of FATP3 in SEQ ID NO:50; and 

c) the nucleotide sequence of FATP 5 in SEQ ID NO : 1 0 1 . 



43. 



A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 
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a) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP1 in SEQ ID NO:46; 

b) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP3 in SEQ ID NO:50; and 

5 c) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP5 in SEQ ID NO:101. 

44. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP1 ; 
10 b) a nucleotide sequence which consists of the coding region of FATP3; 

and 

c) a nucleotide sequence which consists of the coding region of FATP5. 



45. A host cell comprising the vector of Claim 44. 

46. An isolated nucleic acid molecule comprising a nucleotide sequence which 

1 5 encodes a naturally occurring allelic variant of a polypeptide consisting of the 

amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement.of a 
nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:46, SEQ ID NO:50, and SEQ ID NO:101. 



2 0 47. A vector comprising a nucleic acid comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a. fatly acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 
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nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:46, SEQ ID NO:50, and SEQ ID NO:101. 

48. A host cell comprising the vector of Claim 47. 

49. A method for producing a polypeptide which is a naturally occurring allelic 
5 variant of a polypeptide consisting of the amino acid sequence of a fatty acid 

transport protein, said method comprising culturing the host cell of Claim 48 
under conditions in which the nucleic acid molecule is expressed, thereby 
producing the polypeptide. 

50. A vector comprisinig a nucleic acid having at least 90% nucleotide sequence 
10 identity to a nucleic acid encoding a polypeptide consisting of an amino acid 

sequence selected from the group consisting of SEQ ID NO:47, SEQ ID NO:51, 
and SEQIDNO:102. 

51 . A host cell comprising the vector of Claim 50. 

52. A method for producing a polypeptide, said method comprising culturing the 
15 host cell of Claim 5 1 under conditions in which the nucleic acid molecule is 

expressed, thereby producing the polypeptide. 

53. A vector comprising a nucleic acid encoding a fatty acid transport protein having 
an amino acid sequence sharing at least about 95% amino acid sequence 
similarity with an amino acid sequence selected from the group consisting of 

20 SEQ ID NO:47, SEQ ID NO:51, and SEQ ID NO: 102. 



54. 



A host cell comprising the vector of Claim 53. 
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55. A method for producing a fatty acid transport protein, said method comprising 
culturing the host cell of Claim 54 under conditions in which the nucleic acid 
molecule is expressed, thereby producing the fatty acid transport protein. 

56. A vector comprising a nucleic acid encoding a fusion polypeptide, said nucleic 
5 acid comprising the nucleotide sequence which encodes a contiguous portion of 

at least about 1 5 amino acids of a sequence selected from the group consisting of 
SEQ ID NO:47, SEQ ID NO:51, and SEQ ID NO: 102, said nucleic acid further 
comprising a nucleotide sequence encoding a heterologous portion of said fusion 
polypeptide. 

1 0 57. A host cell comprising the vector of Claim 56. 

58. A method for producing a fusion polypeptide, said method comprising culturing 
the host cell of Claim 57 under conditions in which the nucleic acid is expressed, 
thereby producing the fusion polypeptide. 

59. Isolated FATP2 or a functional portion thereof. 

1 5 60. An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:49. 

61. An isolated polypeptide comprising an amino acid sequence which is at least 
95% identical to the amino acid sequence of SEQ ID NO:49. 

62. An isolated polypeptide comprising an amino acid sequence which is at least 
2 0 97% identical to the amino acid sequence of SEQ ID NO:49. 
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63. Isolated polypeptide encoded by a nucleic acid molecule comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP2, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 

5 consisting of SEQ ID NO:48 under high stringency conditions. 

64. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:49. 

65. A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
1 0 amino acid sequence of FATP2 in SEQ ID NO:49; 

b) a polypeptide consisting of an amino acid sequence which is at least 95% 
identical to the amino acid sequence of SEQ ID NO:49; 

c) a polypeptide consisting of an amino acid sequence in SEQ ID NO:49; 
and 

15 d) a peptide comprising a contiguous portion of at least about 15 amino acid 

residues of any of the foregoing. 

66. The fusion protein of Claim 65 wherein the fusion protein transports fatty acids 
across a cell membrane or an artificial cell membrane system. 

67. The fusion protein of Claim 65, further comprising an affinity ligand. 
2 0 68. Isolated FATP4 or a functional portion thereof. 



69. 



An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:53. 
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70. An isolated polypeptide comprising an amino acid sequence which is at least 
95% identical lo the amino acid sequence of SEQ ID NO:53. 

71. An isolated polypeptide comprising an amino acid sequence which is at least 
97% identical to the amino acid sequence of SEQ ID NO:53. 

5 72. Isolated polypeptide encoded by a nucleic acid molecule comprising a 

nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP4, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 
consisting of SEQ ID NO:52 under high stringency conditions. 

10 73. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:53. 

74. A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP4 in SEQ ID NO: 53; 
15 b) a polypeptide consisting of an amino acid sequence which is at least 95% 

identical to the amino acid sequence of SEQ ID N053; 

c) a polypeptide consisting of an amino acid sequence in SEQ ID NO:53 ; 
and 

d) a peptide comprising a contiguous portion of at least about 1 5 amino acid 
2 0 residues of any of the foregoing. 



75. 



The fusion protein of Claim 74 wherein the fusion protein transports fatty acids 
across a cell membrane or an artificial cell membrane system. 
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76. The fusion protein of Claim 74, further comprising an affinity ligand. 

77. Isolated FATP6 or a functional portion thereof. 

78. An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ED NO:57. 

5 79.. An isolated polypeptide comprising an amino acid sequence which is at least 
95% identical to the amino acid sequence of SEQ ID NO:57. 

80. An isolated polypeptide comprising an amino acid sequence which is at least 
97% identical to the amino acid sequence of SEQ ID NO:57. 

81 . Isolated polypeptide encoded by a nucleic acid molecule comprising a 

1 0 nucleotide sequence which encodes a naturally occurring allelic variant of a 

polypeptide consisting of the amino acid sequence of FATP6, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 
consisting of SEQ ID NO:56 under high stringency conditions. 

82. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:57. 

15 83 . A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP6 in SEQ ID NO:57; 

b) a polypeptide consisting of an amino acid sequence which is at least 95% 
2 0 identical to the amino acid sequence of SEQ ID N057; 
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c) a polypeptide consisting of an amino acid sequence in SEQ ID NO:57; 
and 

d) a peptide comprising a contiguous portion of at least about 1 5 amino acid 
residues of any of the foregoing. 

5 84. The fusion protein of Claim 83 wherein the fusion protein transports fatty acids 
across a cell membrane or an artificial cell membrane system. 

85. The fusion protein of Claim 83, further comprising an affinity ligand. 



86. A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:49 or SEQ ID NO:53, comprising the steps of 
10 contacting the agent with the isolated protein under conditions appropriate for 

binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex. 



87. The method of Claim 86 wherein the step of contacting the agent with isolated 
protein is perfonned in an artificial membrane system. 

15 88. The method of Claim 86 wherein the isolated protein is in isolated plasma 
membrane. 

89. A method for identifying an agent which inhibits interaction between an isolated 
protein comprising amino acid sequence SEQ ID NO:49, or SEQ ID NO:53, and 
further comprising a ligand of said protein, comprising: 
2 0 (a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 
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(3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 
5 (b) determining the extent to which said protein of (1 ) and the ligand of (2) 

interact; and 

(c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
1 0 conditions appropriate for interaction of said protein of ( 1 ) with the 

ligand of (2); 

wherein if the extent to which interaction of said protein of (1 ) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
15 between said protein and the ligand of said protein. 

90. The method of Claim 89 wherein (a) is performed in an artificial membrane 
system. 

91 . The method of Claim 89 wherein said isolated protein is in isolated plasma 
membrane. 

2 0 92. A method for identifying an agent which binds to a protein, said protein encoded 
by (1) a polynucleotide comprising a nucleotide sequence which encodes a 
naturally occurring allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP2, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO:48 under high stringency conditions, 

25 or by (2) a polynucleotide comprising a nucleotide sequence which encodes a 
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naturally occuning allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP4, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO: 52 under high stringency conditions, 
comprising the steps of isolating the protein, contacting the agent with the 
5 isolated protein under conditions appropriate for binding of the agent to the 

isolated protein, and detecting a resulting agent-protein complex. 

93. The method of Claim 92 wherein the step of contacting the agent with the 
isolated protein is performed in an artificial membrane system. 

94. The method of Claim 92 wherein the isolated protein is in isolated plasma 
1 0 membrane. 

95. A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein being encoded by (i) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP2, wherein said 

15 polynucleotide hybridizes to a complement of a polynucleotide consisting of 

SEQ ID NO:48 under high stringency conditions, or by (ii) a polynucleotide 
having a nucleotide sequence which encodes a naturally occurring allelic variant 
of a polypeptide consiting of the amino acid sequence of FATP4, wherein said 
polynucleotide hybridizes to a complement of a polynucleotide consisting of 

2 0 SEQ ID NO:52 under high stringency conditions and (2) a iigand of said protein, 

comprising: 
(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 
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(3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 
5 (b) determining the extent to which said protein of (1 ) and the ligand of (2) 

interact; and 

(c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
1 0 conditions appropriate for interaction of said protein of (1) with the 

ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
15 between said protein and the ligand of said protein. 

96. The method of Claim 95 wherein (a) is performed in an artificial membrane 
system. 

97. The method of Claim 95 wherein said isolated protein is in isolated plasma 
membrane. 

2 0 98. A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein comprising an amino acid sequence 
sharing at least about 95% amino acid sequence similarity with the amino acid 
sequence in SEQ ID NO:49, or SEQ ID NO:53 comprising the steps of isolating 
the protein, contacting the agent with the isolated protein under conditions 
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appropriate for binding of the agent to the isolated protein, and detecting a 
resulting agent-protein complex. 

99. The method of Claim 98 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

5 100. The method of Claim 98 wherein the isolated protein is in isolated plasma 
membrane. 

101. A method for identifying an agent which inhibits interaction between (i) an 

isolated protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 

10 sequence similarity with the amino acid sequence in SEQ ID NO:49, or (ii) a 

protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:53 and a ligand 
of said protein, said method comprising: 

1 5 (a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 

2 0 under conditions appropriate for interaction between the said 

protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 

2 5 interaction of said protein of ( 1) and the ligand of (2) occurs in the 
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absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
Iigand of (2); 

wherein if the extent to which interaction of said protein of (1) and the Iigand of 
5 (2) occurs is less in the presence of the candidate agent than in the absence of the 

candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the Iigand of said protein. 

102. The method of Claim 101 wherein (a) is performed in an artificial membrane 
system. 

10 103. The method of Claim 101 wherein said isolated protein is in isolated plasma 
membrane. 

104. A method for identifying an agent which is an inhibitor of fatty acid uptake by 
(i) a protein encoded by a polynucleotide comprising a nucleotide sequence 
which encodes a protein consisting of the amino acid sequence in SEQ ID 
1 5 NO:49, or by (ii) a protein encoded by a polynucleotide comprising a nucleotide 

sequence which encodes a protein consisting of the amino acid sequence in SEQ 
ID NO: 53, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 
2 0. b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the* 

fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
2 5 acid uptake by said protein. 
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105. An inhibitor of fatty acid uptake identified by the method of Claim 104. 

106. The method of Claim 104 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
5 tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
10 an inhibitor of said protein. 

1 07. An inhibitor of fatty acid uptake identified by the method of Claim 1 06. 



108. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by (i) a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 

15 consisting of the amino acid sequence of FATP2, wherein said polynucleotide 

hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:48 
under high stringency conditions, or by (ii) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP4, wherein said 

2 0 polynucleotide hybridizes to a complement of a polynucleotide consisting of 

SEQ ID NO:52 under high stringency conditions, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inliibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 
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c) comparing uptake of the fatty acid in the test cells with uptake of the 

fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
5 acid uptake by said protein. 

109. An inhibitor of fatty acid uptake identified by the method of Claim 1 08. 

110. The method of Claim 108 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
1 0 tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
15 an inhibitor of said protein. 



An inhibitor of fatty acid uptake identified by the method of Claim 110. 

A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by (i) a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 
amino acid sequence similarity with the amino acid sequence in SEQ ID NO:49 
or by (ii) a nucleic acid encoding a fatty acid transport protein comprising an 
amino acid sequence sharing at least about 95% amino acid sequence similarity 
with the amino acid sequence in SEQ ID NO:53, comprising the steps of: 
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a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
5 fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

113. An inhibitor of fatty acid uptake identified by the method of Claim 112. 

10 114. The method of Claim 1 1 2 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
1 5 samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

115. An inhibitor of fatty acid uptake identified by the method of Claim 114. 

2 0 116. A method for identifying an agent which is an inhibitor of (i) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:49 or (ii) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:53, comprising the steps of: 



• 
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(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

5 (c) culturin g a second aliquot of the host cells with fatty acid substrate of 

said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
10 the second aliquot is indicative that the agent is an inhibitor of said protein. 



1 17. The method of Claim 116 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

15 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; and 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

2 0 118. A method for identifying an agent which is an inliibitor of a protein, said protein 
being encoded by (i) a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP2, wherein said polynucleotide hybridizes to a 
complement of a polynucleotide consisting of SEQ ID NO:48 under high 

2 5 stringency conditions, or by (ii) a polynucleotide comprising a nucleotide 

sequence which encodes a naturally occurring allelic variant of a polypeptide 
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consisting of the amino acid sequence of FAT.P4, wherein said polynucleotide 
hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:52 
under high stringency conditions, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
5 polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibi tor o f said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

1 0 (d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

.119. The method ofClaimllS further comprising the steps of: 
15 a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

2 0 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

120. A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by (i) a nucleic acid encoding a fatty acid transport protein 
2 5 comprising an amino acid sequence sharing at least about 95% amino acid 

sequence similarity with the amino acid sequence in SEQ ID NO:49 or by (ii) a 
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nucleic acid encoding a fatty acid transport protein comprising an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:53, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty aci d 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

121. The method of Claim 120 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c). 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

122. A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:57, comprising the steps of contacting the agent 
with the isolated protein under conditions appropriate for binding of the agent to 
the isolated protein, and detecting a resulting agent-protein complex. 
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123. The method of Claim 122 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

124. The method of Claim 122 wherein the isolated protein is in isolated plasma 
membrane. 

5 125. A method for identifying an agent which inhibits interaction between an isolated 
protein comprising an amino acid sequence of SEQ ID NO:57, and further 
comprising a ligand of said protein, comprising: 
(a) combining: 

(1) said isolated protein; 
1 0 (2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 
.under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 
15 (b) determining the extent to which said protein of (1 ) and the ligand of (2) 

interact; and 

(c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
2 0 conditions appropriate for interaction of said protein of (1) with the 

ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
2 5 between said protein and the ligand of said protein. 
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126. The method of Claim 125 wherein (a) is performed in an artificial membrane 
system. 

127. The method of Claim 125 wherein said isolated protein is in isolated plasma 
membrane. 

5 128. A method for identifying an agent which binds to a protein, said protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a naturally 
occurring allelic variant of a polypeptide consisting of the amino acid sequence 
of FATP6, wherein said polynucleotide hybridizes to a complement of a 
polynucleotide consisting of SEQ ID NO:56 under high stringency conditions, 
1 0 comprising the steps of isolating the protein, contacting the agent with the 

isolated protein under conditions appropriate for binding of the agent to the 
isolated protein, and detecting a resulting agent-protein complex. 

129. The method of Claim 128 wherein the step of contacting the agent with the 
isolated protein is performed in an artificial membrane system. 

15 130. The method of Claim 128 wherein the isolated protein is in isolated plasma 
membrane. 

131 . A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein encoded by a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
20 polypeptide consisting of the amino acid sequence of FATP6, wherein said 

polynucleotide hybridizes to a complement of a polynucleotide consisting of 
SEQ ID NO:56 under high stringency conditions, and (2) a ligand of said 
protein, comprising: 
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(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 

5 interaction between said protein of (1) and the ligand of (2), 

under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

1 0 (c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

1 5 wherein if the extent to which interaction of said protein of (1) and the ligand of 

(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

132. The method of Claim 131 wherein (a) is performed in an artificial membrane 
2 0 system. 

133. The method of Claim 131 wherein the isolated protein is in isolated plasma 
membrane. 



25 



134. 



A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein consisting of an amino acid sequence 
sharing at least about 95% amino acid sequence similarity with the amino acid 
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sequence in SEQ ID NO:57 comprising the steps of isolating the protein, 
contacting the agent with the isolated protein under conditions appropriate for 
binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex. 

5 1 35. The method of Claim 134 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

136. The method of Claim 134 wherein the isolated protein is in isolated plasma 
membrane. 

137. A method for identifying an agent which inhibits interaction between an isolated 
10 protein encoded by a nucleic acid encoding a fatty acid transport protein 

comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:57 and a ligand 
of said protein, said method comprising: 

(a) combining: 

15 (1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 

20 protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 

2 5 absence of the candidate agent to be assessed and under the same 
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conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
5 candidate agent, the candidate agent is an agent which inhibits interaction 

between said protein and the ligand of said protein. 

138. The method of Claim 137 wherein (a) is performed in an artificial membrane 
system. 

139. The method of Claim 137 wherein said isolated protein is in isolated plasma 
10 membrane. 

140. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a protein consisting of the amino acid sequence in SEQ ID NO:57, 
comprising the steps of: 

15 a) maintaining test cells expressing said polynucleotide in the presence of a 

fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

2 0 wherein lower uptake of the fatty acid in the test cells compared to uptake of the 

fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 



141. 



An inhibitor of fatty acid uptake identified by the method of Claim 140. 
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142. The method of Claim 140 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

5 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

10 143. An inhibitor of fatty acid uptake identified by the method of Claim 1 42 . 

144. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
consisting of the amino acid sequence of FATP6, wherein said polynucleotide 
15 hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:56 

under high stringency conditions, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

20 c) comparing uptake of the fatty acid in the test cells with uptake of the 

fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 



25 145. 



An inhibitor of fatty acid uptake identified by the method of Claim 144. 
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146. The method of Claim 144 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

1 47. An inhibitor of fatty acid uptake identified by the method of Claim 146. 

148. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 
amino acid sequence similarity with the amino acid sequence in SEQ ID NO:57, 
comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; . 

wherein lower uptake of die fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

1 49. An inhibitor of fatty acid uptake identified by the method of Claim 148. 
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150. The method of Claim 148 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

5 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

10 151. An inhibitor of fatty acid uptake identified by the method of Claim 150. 

1 52. A method for identifying an agent which is an inhibitor of a protein encoded by 
a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:57, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
1 5 polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

2 0 (d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 



153. 

25 



The method of Claim 152 further comprising the steps of: 
a) administering the agent to one or more test animals; 
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b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 d) comparing the fatty acids of b) with the fatty acids of c): 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic val iant of a polypeptide consisting of the 
amino acid sequence of FATP6, wherein said polynucleotide hybridizes to a 
complement of a polynucleotide consisting of SEQ ID NO:56 under high 
stringency conditions, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the fust and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

155. The method of Claim 154 further comprising the steps of: 
25 a) administering the agent to one or more test animals; 



154. 
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b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 95% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:57, comprising 
the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

157. The method of Claim 1 56 further comprising the steps of: 
a) administering the agent to one or more test animals; 
25 b) measuring exogenously supplied fatty acids in one or more samples of 

tissue or bodily fluid from suitable control animals; 
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c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c). 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
5 an inhibitor of said protein. 

158. A method for identifying an agent which is an inhibitor of a fatty acid transport 
protein, comprising the steps of: 

(a) introducing into cells one or more vectors comprising a gene encoding a 
cell surface protein and a nucleic acid encoding the fatty acid transport 

10 protein; 

(b) contacting the host cells with anti-cell surface protein antibody and 
labeled fatty acid substrate of the fatty acid transport protein; 

(c) contacting a first aliquot of the host cells with an agent being tested as an 
inhibitor of the fatty acid transport protein, while leaving a second 

1 5 aliquot of the host cells uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
cell surface protein by detecting the anti-cell surface protein antibody 
bound to the host cells; and 

(e) measuring, in the first and second aliquots, uptake of the fatty acid 
2 0 substrate of the host cells identified as expressing the cell surface 

protein; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of the fatty acid 
transport protein. 



25 159. The method of Claim 158 wherein the host cells regulably express the FATP4 
gene. 
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160. The method of Claim 158 wherein the host cells are prokaryotes. 

161. The method of Claim 158 wherein the prokaryotes are E. coli. 

162. The method of Claim 158 wherein the fatty acid is a radioactively labeled fatty 
acid. 



5 163. A method for identifying an agent which is an inhibitor of FATP4, comprising 
the steps of: 

(a) introducing into cells one or more vectors comprising a gene encoding a 
cell surface protein and a nucleic acid encoding FATP4; 

(b) contacting the host cells with anti-cell surface protein antibody and 
1 0 lab eled fatty acid substrate of FATP4 ; 

(c) contacting a first aliquot of the host cells with an agent being tested as an 
inhibitor of FATP4, while leaving a second aliquot of the host cells 
uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
15 cell surface protein by detecting the anti-cell surface protein antibody 

bound to the host cells; and 

(e) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells identified as expressing the cell surface 
protein; 

2 0 wherein less uptake of the fatty acid substrate in the first aliquot compared to the 

second aliquot is indicative that the agent is an inhibitor of FATP4. 



164. The method of Claim 163 further comprising the steps of: 
a) administering the agent to one or more test animals; 
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b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 



165. The method of Claim 163 wherein the cell surface protein is CD2. 

166. The method of Claim 163 wherein the fatty acid substrate is BODIPY-labeled. 

10 167. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:48, comprising: 

a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
acid comprising SEQ ID NO:48, under conditions that allow 

15 hybridization between 1) and 2) if the sequences of 1 ) and 2) have at 

least about 90% sequence similarity; and 

c) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule comprising at 

2 0 least about 90% sequence similarity to SEQ ID NO:48, has been 

detected. 



168. 



A method for detecting, in a sample of purified nucleic acid, a nucleic acid 
molecule having-at least about 90% sequence similarity to SEQ ID NO:48, 
comprising: 
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a) hybridizing 1) the sample of purified nucleic acid to 2) purified nucleic 
acid comprising SEQ ID NO:48, under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 
5 b) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule comprising at 
least about 90% sequence similarity to SEQ ID NO:48, has been 
detected. 



10 1 69. A method for identifying (1) nucleic acid molecules in fixed cells which 

specifically interact with a (2) nucleic acid molecule comprising the nucleotide 
sequence in SEQ ID NO:48, said method comprising the steps of: 
a) adding to the fixed cells die nucleic acid molecule comprising a 
nucleotide sequence in SEQ ID NO:48; 
15 b) incubating the fixed cells under conditions allowing hybridization of (1) 

with (2); 

c) removing the nucleic acid molecule of step a) that has not hybridized; 
and 

d) detecting hybrid molecules comprising (1) and (2). 



2 0 1 70. A method for detecting FATP2 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP2 to the sample, and detecting 
agent specifically bound to the FATP2. 



171. 



The method of Claim 170 wherein the agent is an antibody which binds to 
FATP2. 
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172. A method for detecting FATP2 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP2 to the sample, and detecting 
agent specifically bound to the FATP2. 

173. The method of Claim 172 wherein the agent is an antibody which binds to 
5 FATP2. 

1 74. An isolated antibody which binds to a polypeptide having an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:49. 

175. An isolated antibody which binds to a fatty acid transport protein having the 
1 0 amino acid sequence in SEQ ID NO:49. 

1 76. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:52, comprising: 

a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
1 5 acid comprising SEQ ID NO:52, under conditions that allow 

hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

c) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 

2 0 suitable control hybridization, then a nucleic acid molecule having at 

least about 90% sequence similarity to SEQ ID NO:52, has been 
detected. 
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177. A method for detecting, in a sample of purified nucleic acid, a nucleic acid 
molecule comprising at least about 90% sequence similarity to SEQ ID NO:52, 
comprising: 

a) hybridizing 1) the sample of purified nucleic acid to 2) purified nucleic 
5 acid comprising SEQ ID NO:52, under conditions that allow 

hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

b) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 

1 0 suitable control hybridization, then a nucleic acid molecule having at 

least about 90% sequence similarity to SEQ ID NO: 52, has been 
detected. 

178. A method for identifying (1) nucleic acid molecules in fixed cells which 
specifically interact with a (2) nucleic acid molecule comprising the nucleotide 

1 5 sequence in SEQ ID NO:52, said method comprising the steps of: 

a) adding to the fixed cells the (2) nucleic acid molecule comprising a 
nucleotide sequence in SEQ ID NO:52; 

b) incubating the fixed cells under conditions allowing hybridization of (1) 
with (2); 

2 0 c) removing the nucleic acid molecule of step a) that has not hybridized; 

and 

d) detecting hybrid molecules comprising (1) and (2). 



179. 

25 



A method for detecting FATP4 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP4 to the sample, and detecting 
agent specifically bound to the FATP4. 
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180. The method of Claim 179 wherein the agent is an antibody which binds to 
FATP4. 

181. A method for detecting FATP4 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP4 to the sample, and detecting 

5 agent specifically bound to the FATP4. 

182. The method of Claim 1 8 1 wherein the agent is an antibody which binds to 
FATP4. 

1 83. An isolated antibody which binds to a polypeptide having an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 

1 0 amino acid sequence in SEQ ID NO:53. 

1 84. An isolated antibody which binds to a fatty acid transport protein having the 
amino acid sequence in SEQ ID NO:53. 

185. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:56, comprising: 

15 a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
acid comprising SEQ ID NO:56, under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

20 c) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule having at 
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least about 90% sequence similarity to SEQ ID NO:56, has been 
detected. 

1 86. A method for detecting, in a sample of purified nucleic acid, a. nucleic acid 

molecule comprising at least about 90% sequence similarity to SEQ ID NO:56, 
5 comprising: 

a) hybridizing 1) the sample of purified nucleic acid to 2) purified nucleic 
acid comprising SEQ ID NO: 56 under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 
10 b) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule comprising at 
least about 90% sequence similarity to SEQ ID NO:56 has been detected. 



1 87. A method for identifying (1) nucleic acid molecules in fixed cells which 
1 5 specifically interact with a (2) nucleic acid molecule having the nucleotide 

sequence in SEQ ID NO:56, said method comprising the steps of: 

a) adding to the fixed cells the (2) nucleic acid molecule comprising the 
nucleotide sequence in SEQ ID NO:56; 

b) incubating the fixed cells under conditions allowing hybridization of (1) 
2 0 with (2); 

c) removing the nucleic acid molecule of step a) that has not hybridized; 
and 

d) detecting hybrid molecules comprising (1) and (2). 
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188. A method for detecting FATP6 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP6 to the sample, and detecting 
agent specifically bound to the FATP6. 

189. The method of Claim 1 88 wherein the agent is an antibody which binds to 
5 FATP6. 

1 90. A method for detecting FATP6 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP6 to the sample, and detecting 
agent specifically bound to the FATP6. 

191. The method of Claim 190 wherein the agent is an antibody which binds to 
10 FATP6. 

192. An isolated antibody which binds to a polypeptide having an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:57. 

193. An isolated antibody which binds to a fatty acid transport protein having the 
1 5 amino acid sequence in SEQ ID NO:57. 

194. A method for modulating fatty acid uptake of cells in culture, comprising adding 
one or more agents that modulate fatty acid uptake to cells comprising one or 
more fatty acid transport proteins. 



195. 

20 



The method of Claim 194 wherein the agent modulates fatty acid uptake by 
modulating biosynthesis of one or more fatty acid transport proteins. 
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196. The method of Claim 195 wherein the agent modulates fatty acid uptake by 
modulating biosynthesis of FATP6. 

197. The method of Claim 196 wherein the agent is an antisense oligonucleotide. 

198. A method for inhibiting fatty acid uptake in the small intestine of a mammal, 

5 comprising administering to the mammal a therapeutically effective amount of 

an agent which is an inhibitor of fatty acid uptake by a fatty acid transport 
protein in the small intestine of the mammal. 

199. The method of Claim 198 wherein the agent is administered orally. 

200. The method of Claim 198 wherein the fatty acid transport protein is hsFATP6. 

10 20 1 . A method for inhibiting fatty acid uptake in cardiac muscle of a human 

comprising administering to the human a therapeutically effective amount of an 
agent which is an inhibitor of fatty acid uptake by FATP6. 

202. A method for directing an agent to cardiac muscle in a mammal, comprising 
administering to the mammal a complex which comprises the substance and a 

1 5 moiety which binds to FATP6. 

203. A method for directing an agent to liver in a mammal, comprising administering 
to the mammal a complex which comprises the substance and a moiety which 
binds to FATP5. 



• 
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204. A method for detecting a variant allele of a human FATP gene, comprising: 

a) preparing amplified, purified reference DNA encoding all or a portion of 
a FATP from a human, and amplified, purified test DNA encoding all or 
a portion of the FATP from a human to be tested as having a variant 

5 allele; 

b) determining whether the reference DNA and test DNA differ in DNA 
sequence; 

wherein, if the test DNA differs in sequence from the reference DNA, the test 
DNA comprises a variant allele of a human FATP gene. 
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^^^^ 1560 
TmaaMac ^^ 1600 
AAAAAMAMAAAAAAAAAMA 1622 — , 0 

fcsE&lE2 protein sequence 

™^S3ima^jamHs^^ 40 

^^^HS^^TZJCTHX^ 80 
^YNTIVIQYZC^ tT ^W^■CNSr^KE^TOIHKyEOiIi2^I3J 120 

g ^^KZI im^^ 200 

Mgg CCT^ 240 

NS33:mraH ^^^ 280 
GDKFF 286 

h£EMP3 ma. sequence 

A3arao^^ 80 

rc 3333 ^^^ 220 



WO 99/36537 



14/117 

JC-^^^ 200 
QGTO Sagif ^ ^ 240 

oaafflu^LaaaQsaonatj^^ 280 
Q 333 ! 0 ^^^^^ 320 

T 1002 ^^ 400 

5GICITIG3 ^^ 440 

GauA^auugancroGaoc^ ^ 520 
TO:ra ^o^Qcm^ 550 
a^^^tos^^ 500 

^®^p^t^^^^33333ITO3m[QI35C 640 
'VZX&&JI?£2Jl^ 753 

hsEKTP3 protein sequence 
^^^^^^^^^ 

LP3?aRERE ^ 150 
hsEKEP4 ENa. sequence 

A 01 ^^ 120 

SSj^^ 280 
J™^^ 320 

aH0CITftcro3 ^^ 520 



WO 99/36537 PCT/US99/00182 



15/117 

aaccICTa C^^ 560 

Q^rrazDaasrc 734 

r-V.^U_NJ2. l5D 

tisEKEP4 protein sequence 




^^^^^^^ 



213 _ 
hSFKIPS nsa. sequence 

SgS^^ 3 2 0 

^^^^ 350 

"I^^TO^^ 440 
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T1Ujis033a 3&S^ 880 
'KZC^ 920 
a3 ° CPa ^ ^^ 960 
C ^^^^^g G33Cnca M GIG333 m aG ^ 1000 

G3:aCTCft a3^^ 1080 

aaTCaa ^3O0^^ 2150 

aCCTITCKr ^^ 1200 

AarTCa ^^^ 1240 

aa£?ZfiGIG3::: ^^ 1278 

hsEEEE5 protein sequence 

EaflEEVRnNQGFCr ^^ 80 

^j^^ 220 

<3OTEWKJ3N ^^^ 160 

(3 ^ 3 ^^ T --^Q^ 233 

hsEKEPS irci sequence 

a3craQK3r '3r^ 40 
^a^ufOGaasfflGgfflG ^ 80 

JaKnQ3Rc ^^ 220 
GSGAG ^ 3 ^^ 260 
G3aAsro3ra3 ^^ 200 

<raD:Eafl:rC ^^ 280 

AED3aQcaa ^^^ 320 

™ craa ?^^ -350 
^^^^^^^^^ 

^M^ggg^ 440 

«M«M ^TMy i !iM 52o 

560 
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(a g3G33 B? ^ 100Q 

^3™™^^ 1040 
CTBQaafflGTO^^ 1080 

^"f^^^ 2220 

G33 3riTOa Qaa^^ 1320 
A 1361 , — . 

hsEaiP6 protein sequence 
mtEaa£>.lNA sequence. 

TICEIGSi ^^ 80 
• aaaro3U 533OTaa2iG^ ion 

320 
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^33333^^ 400 



cxzxaxno^^^ 600 

Q 3133 *^^^^ 720 

<xxzxa^^ 880 

^SSJ 3 ^^ 920 

ul ^^3^m33333Tia^^ 960 

° 333:3ffi0 33^n^ 1000 

Amt: ^^ 1040 

02aQ3IGa3G31 ^^ 1080 

AK:iQ:33 yPSgI^ H2o 



■ra^^ft2CTTC?J03n^ 2200 

0a3m3a:a: ^^ 1240- 

ACma03 ^- 2 ^^ 1320 

a33CrGraa ^^ 1360 

Ga3AaataflGr ^^ 1400 






--^ t *-^^»j^-~ij^^^uJUiuJ3 I960 

TIl:MC M^ 2000 




QCBITOG 2007 
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mtEATP protein sequence 

azp^skasigt^c^^ 80 

^sasrgivagc^^ 200 

ta^djftsgttgfekasvmtto^^ 240 

sc^ysclplytar^^ 280 

fw3eviam^^ 320 

cgnglxpeiwfeft±rf gvarvce^ 3g 0 

vprtsgrcsprp^^ 4^ 

gHlsrvnrl<^f^ 440 

dgt^esct^ygv^p 520 

^^^Wpgyalplfvr^ 560 
gadie^yvlagpdegyypiya^ 597 



V 
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hsFATPl 

& teg ace cm gge ore egg g*c ece aaa gca «u gee ege aa gta gge aca gcg cae cca 

•9* agg gtc eag m tct pcigu aca gu ago cee ccg gee tea gee tec tag tec ctg 
eet gee ece ege ctg age tec egg gag act gu gge acg get tgc «ge etc agg atg egg 

M R 

get ccg got gcg gge gcg gee ecg gcg gee teg ctg gcg ctg teg tgg ctg ctg ggg ctg 

ccg tgg ace tgg age ecg pea gcg geg eee gge gtg tac gtg gge age gge gge egg ege 
PWTWSAAAfttavyvaSQOwn 

etc etg ege ate eee ege ug kg gcg agg cga P*e etc eec ggt etc tet peg ctg ace 

PtnSVCXTAItltOLPOLSVLX 
tgc gcg ege ccg gag ctg egg egg eac cag egt gee gge cae ace ate ccg ege ate tee 
ftVRLILRRHQRAQlITlPRXr 



121 

1*1 

241 
103 



421 
441 



OAVVOROPtRLAtVOAOTOB 
tge egg ace eec gcg cag etg cae gee tae ece aae gcg gca gee aac etc eee ege eag 
CMTrAObOAVSMAVAWtrRO 
5*1 ctg gge tte gcg ccg gee gae gtg gcg gee ate etc etg gag gge egg ccg gag ttc gtg 
bOFRFCDVVAX PLBCRPipv' 
009 «=cg tgg ctg ogc etg yec aag gcg gge acg gag gee gcg ctg etc aac gcg *ac ctg 
OLHLOtAKACMRAALLHV.li. 



601 
(61 

721 
781 



1111 
1201 
1361 
1221 



1**1 

1501 



1421 
1641 
1741 



2161 
2221 
2241 
2241 



2441 
2701 
2741 



3121 



3201 
3241 



3441 
3341 



2gge*cetcgggegcsaaggecc£gacettcgga 
C T 5 " 



961 
1021 



09* gaa atg gtg gcg gcg gcg gec gaa gcg age ggg eat ecg ggg mi age ctg aee aag 
eCMVAAVAEVSCttLOKOLXK 
eec tge eet gga gac etg ggg ece gag gge ace ttg ccg gae ace cae etc eeg gae ccg 
'CSODLOPEGXLPDTHLLDP 
— ccg etg aag gag gee cee act gee eec eeg gca eag ace cee age aag gge acg gae gae 

tfcKRAiTAPLAOlPSKGMDD 
Wl egt ett etc cac ate eac acg teg ggg acc ace ggg ctg cee aag gee gee ate gee gcg 

■tfYJYTSOTTQLRKAAIVV 
cae age agg eac eac ege acg gca gee eec gge eae cac gec eae cgc acg cag gcg gec 
"SRTTRMAArCMUATftMOAA 
0«c gcg eec eat gae ege ccg ccc ctg tac cae teg gca gga aac ate aec gge gtg ggg 
OVLlDCtPtTHSAOHI 1 C V O 
eag egt etc aee cae ggg ccg aca gte gee cue ege aag aaa cee ecg gee age cgc etc 

O C t X rOLTVVLRRKr*ASRF 

tgg gae gae ege acc aag cac aac tgc acg gcg get cag cac acc ggg gtg ate ege cgc 
" *>DC X KYNCTVVQ Y IC CI CR 
tac etg ccg aag cag ccg gtg cgc gag gcg gag agg cga eae cgc gtg cgc ctg gcg gcg 

Yt fcK0PVRKA4RRMRVRLAV 
OS- *»c Bgg eeg ege ece gee acc egg gag gag ccc acg gag cgc etc gge gca ege caa 
CIIO >'«PAI»*BEFTZRravilO 
aec ggg gag eec cac gge gee acc gag ege aac ege age aet gee aac acg gae gge aaa 

ICEFYGXTBCNCS IAHMDGK 

1281 gec gge ere t,-c gge ccc aac age cgc ate ccg ccc cac gcg tac cee ace egg etg gtg 



aag gec aac gag gae aca acg gag ccg ccg egg gat gee eag gge ccc cgc aec ccc tge 
"VMEOTMELLRDAOOLCX P C 
cag gec ggg gag ccc gge etc etc gtg gge cag ate aae caa eag gae ecg ctg ege cgc 
O AOBPCttVOQIiiaODPtRR 
etc gat gge tac gtc age gag age gec acc age aag aag ate gee cac age gtc ttc age 
r0C *VSESATS4KXAHSVrs 
•*0 00e gae age gee eae etc cca gge gae gtg eta gtg acg gae gag ctg gge eac acg 
ROD3AYLSGDVLVMOELOYH 
eac ece egg gae cgc age ggg gae aec eee cgc tgg cga ggg gag aac gtc cee acc acc 
rr *»*SODTrRHRQi;»ivSTT 
0*0 Otg gag gge gtg ctg age ege ctg ecg gge cag aca gae geg gee gec cac ggg oca 
,.«, * VCev fcS*LLC0TOVAV*CV 
1R01 get get cca gga gcg gag ggt aag gca ggg atg gcg gee gte gca gae ccc cac age ctg 

1161 ctg gae ccc i 
L 0 P » 

1921 
1M1 
2041 
2101 



„ : eag gag ctg cag aag gcg ctg gca ccc tat gee egg ece 

LOP "AI*ORLQKVLAPYARP 
ate ttc ccg cgc cee ccg ccc eag geg gae acc aca gge ace tee aag acc cag aag acg 
JrtRttPOVDTTOTrKIOKT 

agg ecg cag cga gag gge tec eac cca cgc cag acc tea gae egg etc tte tte ctg gae 
"^W^^epDPROTSDRtPrtO 
etg aag cag gge cac tae eeg cee tea aae gag gca gee tac acc cgc ate cgc ecg gge 

gee etc gee etc to. age ege eec tct act gge cac aaa cee egg gee egg egg gag agg 

eea get ega gec aoa cag ege ege cca ggg geg gee gec tag tac aca ccc acc tgg ccg 
age cgx aec egg cae gge cca tec egg act gag aaa ctg gaa ece cag agg aac eeg cgc 
cee tec get gee ttg geo ccc ccg tgc etg cct ccc etc cct get ttt eag cee ctg tct 

um S^^^ 9lCettWtWKltt,etcttecrecte tte cct « e e " ett tct 

Ual tee ose tea **0 aca gag tct cae tee get gee egg get aga gtg cag tgg egg gat 

*S, "* w ^^ ww ^ectcccggogctcaagtgatecteeeacetcagccecct 

W21 gag eag ecg gga tea cag gca ccc gee aee acg tee age caa ett tea tae ttt eag eag 

mi aga egg ggt ttc acc atg etg gtc agg etg gee teg aae eee cga cct cag gcg ate cgc 

SS«~ ctc " l,l,,et9n,t " Uffwl,iwctet »» cet " cci «« 

1™ ~ ^ ^ 818 w BU ^ 469 w 661 "» W eg: ate tt, tgt 

agg gee cag etg ece teg ggg acc gca gga acc ate ece cee ggg ece egg acc egg aec 
m!? eca ece ece cct egg ctg ege ett acg gag ccc caa ece agg cot ece gtg 

cm« S?* ttC at * Ct ' C * B cte C * t tec *** c ** 005 ne C9C cce etc 

tge cga atg gag gag ccg ggg gte ece cag oce aac egg aaa ace ece cag get agg cca 

~ e " *** *" tcc ltc cac ctc ece age ccc ace ttc cce ece tg» 

tgc cct gaa age tte egg aat cga ctg tga cca ett gga tgc cac cac tct cag cee ctg 

2141 «2™H Ce * tttl * eclteieC ' t9W ' eteet9et0 W« cca 9" e « ege 
3241 ace^ S 2 2S T " 9 C " 9tC W * CCC etc ctg gge gee etc 

°™ cce ggg ggt egg cct etc aag cct cag ggg ttc tag cct gtt gaa eae acc cca cct 



3421 cca aE ^ W " c w ^ ctc « c °* 00t gge gat gte 

™ ZZ tH cm * **• tcc «* ct » «« tc e ctg geg gee cac ctc 

~! 7£ !!! ^ " c «•» W egg cca ece agg eeg ece gec cgc eeg tee ccc 
«e IT- f!! CBC ccc «« sec « 09* Otg gtg ggg cca tgg caa gag aca ccg egg egt 
ctc aeg ega aet eec etg gge aet gtg get tta ttt cet aac tga tet aag aaa taa ace 
tga aga ccg cct ggt gaa aaa aaa «u aaa aaa agg gcg gec ge 
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hjFATP4 








i 




aeg ege ecg ggc ggg egg ggc cog 


gcg gcg ggc ggg get 


oge ggg gcg gee ggg 


61 


oca tge 


agg gcg eag age egg eta aac cet 


gee gag ace egg eee ege gcg tee agg gge 


121 


gge tu 


ege ccc cca ege tgt eta ege tge 


tge aae egg gee gca 


cet gga egg gge gee 


181 


flcg egg 


egg age ega cgc egg gee aca aeg ctg ett gga gee tee ctg gtg ggg gtg ctg 






M 


L L G A 8 


L V G V L 


2*1 


ctg ttc 


ccc aag ctg gtg ctg aaa ctg ccc 


tgg ace cag gtg gga 


ttc ccc ctg ttg ttc 




U T 


SXLVLXZ.P 


W T 0 V G 


F S L L F 


30} 


cee tae ttg gga tct ggc 99c tgg cue etc 


acc egg gee ete ate 


aag acc ace agg ege 




fc T 


LGSCOWRP 


X R V P I 


X T X X R 


361 


gae ate 


ttt ggc ggc ctg gee etc ctg aag gtg aag gca aag gtg ega eag cgc ecg cag 




0 z 


FCCLVLLK 


V X A X V 


R Q C L 0 


421 


0*0 cot 


egg aca gtg ccc ate ttg ttt gee tct ace gtc egg cgc 


eae ccc gae aag aeg 




X R 


RTVPXLPA 


S T V R R 


H P 0 X T 


4B1 




ate etc gag ggc aca gae ace cac 


tgg acc ttc cgc cag ctg gat gag tae tea 


S«l 


A L 


X PEGTOTH 


M T P R 0 


L D E Y S 


age age 


gca gee aac cee ctg cag gee egg 


ggc ecg gee ecg ggc gae gtg get gee ate 




s s 


VANPLQAH 


G L A S C 


D V A A X 


601 


etc atg 


gag aac ege aat gag etc gtg gge 


cca tgg ctg ggc atg 


gee aag cte ggt gtg 




r m 


SMRMEFVO 


L V t 0 M 


A K L C V 


661 


gag oc* gee etc ate aac ace aac ctg egg egg gat gee ctg etc 


eae tge cte acc ace 




E A 


ALXMTNLR 


R D A I U 


H C L T T 


721 


teg ege 


gca egg gee ett gee ttt ggc age 


gaa atg gee cca gee ate ege gag gtc cat 




S X 


ARALVPGS 


E H A S A 


X C E V X 


781 


gee age ctg gac ccc teg etc age etc ttc 


tge tet ggc tee tgg gag ccc ggt gcg gcg 




A 6 


LDPSLSLF 


C 5 G S M 


X P G A V 


841 


cet cea 


age aca gaa cac ecg gae ccc ctg ecg aaa gat gee ccc 


aag cac cte ccc age 


901 


P P 


STEHLDPL 


L X D A P 


X H U P S 


tge cet 


gae aag ggc etc aca gae aaa ecg ttc Cac ace cac aca 


tec ggc ace aca ggg 


961 


C P 


ORCPTDKL 


P Y I Y T 


S G T T G 


ctg ere 


aag gee gee ate gtg gtg cac age 


agg tat tae cgc aeg 


get gec ceg gcg cac 




It P 


KAAXVVHS 


R Y Y R M 


A A L V Y 


1021 


tat gga 


ccc cgc aeg egg ccc aac gae ace 


gee eae gae ege cee 


ccc ccc cac cac cca 




y a 


FRMRPNDI 


V Y 0 C L 


P L Y H S 


loai 


gca gga 


aac ace gtg gga ate ggc cag cgc etg ecg cac gge aeg 


aeg gcg gcg act egg 




A G 


HIVCXOQC 


L L K C M 


T V V X R 


1141 


aag aag 


cee cca gee tec eg? etc tgg gae gae tgt ate aag cac 


aac cgc aeg ate gtg 


1201 


K X 


PSASRTWD 


D C I K Y 


N C T I V 


cag tae 


ett gge gaa ctg cgc ege tae etc 


ecg aac cag cca ceg 


egg gag gca gaa aac 


1261 


0 Y 


XGELCBYL 


L M 0 P P 


R E A E N 


cag cac 


cag get cgc aeg gca eta ggc aat 


ggc etc egg cag ccc 


ate tgg acc aac ttt 


1321 


0 H 


QVVHALCN 


G L R 0 S 


J H T N r 


tec age 


cgc cte cac ata ccc cag gtg get 


gag etc tae ggg gee 


aca gag tge aac tgt 




S S 


RFHXP QVA 


E P Y G A 


T E C M C 


1381 


age ctg 


ggc aac tec gae age cag gtg ggg 


gec cgc ggt tec aae 


age cgc ace ctg tec 




8 L 


GNPOSQVG 


A C G P H 


S X X L S 


1441 


etc gcg 


tae ccc ace egg ttg gta cgt gtc 


aac gag gae acc aeg 


gag ceg ate egg ggg 


1501 


r v 


YPIRLVRV 


M E D T H 


E X. X R G 


ccc gae gge gee toe att ccc tge cag cca 


ggc g*g ceg ggc cag 


ctg gtg gge ege ate 


1561 


P D 


G V C 1 P C 0 P 


G E P G 0 


L V G R 1 


ate cag 


aaa gae ccc ctg cgc cgc ccc gae 


ggc cac ccc aac cag 


ggc gee aac aac aag 


1621 


1 0 


KOPLRRPD 


G Y L N 0 


G A N W X 


aag att 


gee aag gat gtc ccc aag aag ggg 


gae cag gec cac ett 


act ggc gat gtg ctg 


16B1 


X X 


ARDVrXXG 


D O A Y L 


T C D V L, 


gtg aeg 


gae gag ctg ggc tae ctg tae tee ega gae cgc act ggg 


gae aeg ttc cgc egg 




V M 


DELCYLYP 


R 0 R T G 


D T P R W 


1741 




gag aac gtg tec ace ace gag gtg 


gaa ggc aca etc age 


cgc etg ctg gae atg 




X O 


EHVSTTEV 


E 6 T L S 


R L L D M 


1801 


get gae 


gtg gee gtg tae ggt gtc gae gcg 


cca gga acc gag ggc 


egg gec gga atg gec 


1861 


A 0 


VAVYGVEV 


P C T E G 


R A G H A 


get gtg 


gee age eee aet ggc aae tgt gae 


ecg gag cgc cee gee cag gee ctg gag aag 




A V 


AS PTGNCD 


L E R P A 


0 V E, E X 


1921 


gaa ctg 


ccc ecg cac gcg cgc ccc ace ttc 


ctg cgc etc ceg cet gag ceg cac aaa aca 




E L 


PLYARPIP 


L R L L F 


E It H K T 


1981 


gga ace 


tae aag ttc cag aag aca gag cca 


egg aag gag gge ttt 


gae ecg get att gcg 




0 T 


YXPQXTEL 


R X E G P 


D P A X V 


2041 


aaa gae 


ecg ctg ttc eat eta gat gee cag 


aag ggc cgc cac gec 


ceg ctg gae eaa gag 




X P 


PLPYLDAQ 


X G R Y V 


P h D 0 E 


2101 


gee tae age ege ate cag gea ggc gag gag 


aag ctg ega etc ccc 


cea tec etc ega ggg 




A Y 


SRXQAGEE 


X L • 


2161 


cog gcg 


gae get gga tee gga gee cca ggc 


tee gec cca gag egg 


ccc tgg aca agg cea 


2221 


gae eaa 


age aag eag ggc etg gca ccc cca 


tec tga ggc gec gee 


cet cca tec aaa act 


2281 


gee aag 


tga etc ate gee eee cca ace ccc 


cea gag gee ttc tgt 


gaa age etc atg tec 


2341 


aag ttc 


egt ett ctg gge tgg gea gge eee etg get ccc agg etg 


aga ctg aeg ggt ttt 


2401 


etc agg 


aeg aeg tct tgg gtg agg gta ggg 


aga gga eaa ggg gtc 


acc gag ccc ttc cca 


2461 


gag age 


agg gag cee ata aae gga ace aga 


gca gaa gte ccc aga 


etc agg aag tea aca 


2S21 


5*0 tgg gca ggg aca gtg gta gea tec ate 


tgg egg cca aag aga 


ate gca gee cca gag 


2581 


ctg ccc 


aag ttc aet ggg ccc cac ccc cac 


etc eag gag ggg agg 


aga gga eee gae ace 


2641 


tgt agg 


tgg ccc ctg atg ccc cat eta cag 


eag gag gtc agg acc 


aeg ccc ctg gec tct 


2701 


ccc cac 


tec ccc ate etc cee eee ggg tgg 


etg cet gae tae ccc 


tea ggc agg gec Cet 


2761 


cag tec ttg tgg gee tgt gec ace tec ate 


tea gtc ceg gee egg eta tga ggg gag gag 


2821 


g*a egg 


gag agg ggg eee agg ggc eaa eaa 


acc ecg ccc ega gtc 


etc cca aaa aaa aaa 


2881 






egg ecg c 
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Protein sequence 646 a.*. KRAPGAGAASW ... VnH I CSBtf3U. 



100 200 300 

' ■ ' > I ' I > ' | ■ ■ i I , > . , . | . ■ , i 1 I I I 1 ( t i 



600 



3 - 
2 - 



! i i i * i . . i > [ » n » i i i i i .( i > n 




'1 

♦2 - 
-3 - 



_lJ_ 



if ii hi 



05 



500 



•rotem 


sequence «4 


6 a. a. 


KPAPCaSAASW 


... vr 




S46 tedna Acids 


: 


UMJ Delcon 








n 


a:%i 




»#t%> 


A tit 


alanine 


64 


' 9.9 


4541 


6.4 


c eye 


cytceine 


15 


:.3 


1545 


2.2 


0 A ftp 


aiparcic acid 


30 


t.S 


3450 


4.9 


£ glu 


glutamic acid 


31 


4.9 


4000 


S.6 


r phe 


phenylalanine 


29 


4.5 


4284 


3.0 


s oly 


glycine 


63 


9. a 


3592 


5.1 


k hi* 


hiecidine 


13 


2.0 


1781 


2.5 


I 1U 


iioleueine 


29 


4.5 


3279 


4.6 


X lyi 


iyeln* 


22 


3.4 


2810 


4.0 


L leu 


leucine 


77 


U.9 


B707 


12.3 


K m*t 


Decnionine 


U 


1.7 


1441 


2.0 


N a en 


aspaxagine 


15 


2.3 


1710 


2.1 


P pro 


proline 


39 


4.5 


2014 


4.0 


0 Sin 


clutanane 


25 


3.9 


3201 


4.3 


R ary 


erguilne 


49 


7.6 


7641 


10.8 


S Bar 


serine 


33 


5.1 


2872 


4.0 


T thr 


threonine 


27 


4.2 


2728 


2.1 


v vel 


vaxine 


51 


7.9 


S0S2 


7.1 


H tXP 


tryptophan 


9 


1.4 


1671 


2.4 


X ulo# 












Y Cyr 


cyxotine 


24 


3.7 


3913 


:.s 


t 


STOP 
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h»FATP4.pep •> KD Hydrophobic! ty <11/1> 

eqtunce SO a. A. MLLCXSLVCVU. ... AYSJtXOAGEEXX, 



1UU 


ZOO 


300 


400 


500 

i i i J | I 1 I 1 | 


600 

mTpm 


- \}VV ^ w w w yv— y ^"y^ 

' » ' ' 1 t 1 1 t t 1 1 f 1 1 1 t 1 ■ 1 1 » t ■ f * i i » 1 , , I , , , , I , , , , t , , , , | , , , , \ ... , 
100 200 30fl AOQ SnD AAA 



-1 

2 
-3 



hsFATF4.pep •> A« A. Usage 
Procain sequence 64 J •.«. 



MLLCASLVCVLL ... AYSfilQAGEEKL. 



643 Amino Acids iw : 72018 Oaiton 



A ill 

C cym 
D asp 
£ glu 
F {*>• 

0 gly 

K hi a 

1 Ue 
K lye 

L lCU 



P pro 

0 gin 

R 4T0 

S tar 

T chr 

V vol 

w trp 
X ukw 

v cvr 
z — 





n 


n<%> 


Hf Wit) 


4L<nim 


46 


7.2 


326? 4.S 


cysteine 


16 




1648 2.3 


•sparcic acid 


33 




3795 5.3 


glutamic acid 


33 




42SB 5.9 


phenylalanine 


34 




5000 6 . 9 


glycine 


54 




3079 4.3 


hi set dine 


12 




1644 2.3 


isaleucine 


30 




3393 4.7 


lysine 


31 




3970 5.5 


leucine 


76 


n!s 


6594 11.9 


netnianine 


12 




1572 2.2 


asparagine 


21 




2394 3.3 


proline 


31 




3008 4.2 


glutamiM 


23 




2945 4.1 


arginine 


43 




7024 9.8 


sarin* 


35 




3044 4.2 


threonine 


32 




3233 4.S 


valine 


46 




4357 6.3 


crypt ophon 
unKnoun 


8 




1488 2.1 


tyrosine 
STOP 


25 


3.9 


4076 5.7 
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AJgfwwfl Rvnoit of Fig. 5 hmFATPii^q Mtgaftgrv using Chain rmffiod irttt* W«igM«d rvskftj* w*igN tsbt*. 
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h Weighted residue weight table. 



Peg.? 




eg hFKTPlcon.seq CRF 
~* ~ ■A.'ffi A^sl nfKTPl.seq CRP (Cran oencmic) 

■dj G hFATPlcrxi. a 
* A mPATPl.sei? CRF Jfrcm genomic) 

hFATPlccn.saq ORP 
mFATOl.seq ORP <fxan genanlc) 



Decoration 'Ctecaration »1 *i Shade (with solid bright yellow) residues that natch the consensus i 
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Algrwrwnl R»porl c4 FATP4 DNA atgnm«nt uskig Ctusut method *»Oh WvtgDMd nakJum might tm&m. 





^"x^^x*. 3| B 
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A/ignm«rt Rooort of UrtHttao. using Ou»J*» method wftn PAM250 r»«kJu» weigm l»b*r 



><«EV| hsKKXYlpep 
' ; sitfKmpep 




Decoration 'Decoration *2' 2 shade (with solid bright yellow) residues chat natch the < 



i named 'Consensus II" exactly. 
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bsPATF6 

1 mo ggo Nt cu gcg cm etc mc cu cgL m u« cu »ct ctl u« sea etc up «a 

« tu ae »t it« at ecg «ge tq ut «it m w Mt tn M egg a> ut ta 
»i c«e agg *ac ae et* cga esc gw cm gaa tu Mt acg Mt cm tit agg gu ttt age 
«1 cn tgt t»c cu su ttc jjc k? »« gat get w etc ctg <w got CW tog tjt gti 
*~ SWKtguMtqctgetcttctaguttatctueewcttttCRggetcgiM 
Ml eigeet«.ieteiiotteKsaii«Kuetm9i l gi»iM, e tt e t, Ken , 

necttcttagiwgtg M HameteaewetttcageeBooaotaictoug 
«U ettiKtttcflaccicttggncwecttttegBtgetMaaitaittetgutcte 
«1 iQ|itteeteeee»teec«ctte9eceewiM«et0ieag«atOflgtQtMgee 
*J1 eta tgt igt gt« «it ctg egg tet cc9 tgg get gtg ect BBt «g» gi* ««• ege tgg 
»» tgg «» ctg tgi ca gig ctg tct ttt ggc ca «tt gec tee itg at etg ca tn cu 
ui M fc fc $ » L 

6,4 * cm •** eta ggg get go* «cg gtc ate ecg eae etc eta cm ■•* etc ecg tec cct ue 

TVtOfcCMWVLMrLQIULr^Y 
™ 1(6 B" etc csv etc org ctg ug gtg gte etc *cc aca act egg ecg aag mi 

711 9M «M eg* ggg gag ecg geg aee gee ctg gM aM etc ceg age eM gee aaa ag* 



_ T 00 • oe *°* OtO 8** eit gte tie ctg aac cat tec cct etg aaa **g ggg gac 
K **««VAHVfLMH8$LItKC0 
•eg gcg get ecg etg »tg age IK gag cog gac ttc get cm gtg tgg tte gge etc gee 
TVALLM«ltE»orviV»*rOfc* 
aag ecg gge tge gtg gtg gee tte etc aac ace aac act ego tee aac cee etc ctg Mt 
R( -6CVV*rtNTMIHSNSl.LN 
tgc ate egg gee ege ggg cee aga gee cca gtg gtg gge gca gac etg etc gga acg gta 



1141 



La ace eet cca age etc tu ga* aat ace age get tgg ggg acg aM gu cct get 

«a cm gge gta act tea etc aaa gu -a etg age ace tea cct gat gag cee gtg cca 

i-H ege age eac cat gte gtc cca etc etc aag cct act cot etc cm act eet ace tct gga 
t *«KVVSLt.KSTCI.YirTSC 
•ea aca ggt ct* cca aa. gea get gtg att agt eag etg cag get eta agg gge cee get 
TTCLPKAAVISOLQVLaCSA 
gec ceg tgg get ett gge egt act gee cat gae ate get eat at* ace cct cet ctg tae 



Ull 



•w cca gca gee ace etg gga act cee gga ege get gag ecg gge gee act tgc gtg 
- «*»*XI.Ol«CCVtfcOATCV 
1501 ci* Mg aag aaa ect cea gca age cag cee tgg age eac cgc aag aag cat gae gcg ace 

(•Rkxfsasqpwsdckkydvt 
etg ttc eag tae act gga gaa cct tge cgc tae cct ege aM cm ect aag aga gaa gga 



1541 
1521 



t cat aag gcg cgc etg gca act gga aac gge tea egg agt gat gta egg aga 
«il gaa cee eta gac aga ttt gga aM aca aag gtg tgc gaa ctt eac gca gee ace gM tea 

irLO-FGMlKVCILYAATES 
1741 age a» tee etc acg aac tae ace ggg aga «c gga gca ate ggg ag* aca aat teg cct 
I'THHTTG R IGAXGRTMLf 
:" c *** cct «* «e act ttt gae tea aea aag cat gac ttc eag aaa gae gaa cee acg 

1MI aga aat gag cag gge tgg tge ate eae gcg aaa aaa gga gaa ect gga etc etc att tct 

"MCQQwciHVKItacpaLLI S 
1,31 ■» «*« **« «cc tte ttt gge eac gee ggg cee CM aag eae aea aaa gae 

»U aaa ecg ctt ege «M get cct aag aag gga gat gtt tae ett aac act gga gae tta aca 

KLLCDVrXK^nuw. „ _ _ _ _ 
2041 gee c 

3101 

nn 

m tee ate act eta aM cca Mt aca ece tta gae ctg gu laa ect tae gaa La get gca 

8IIt *P»*TStOLC*VYCOVV 
»u aca etc eea eea gee tat get tgc cca cga etc tea aga att cag gaa aaa atg gaa gca 

i cca ctg aag cat cag ctg gtg gaa gat gga ece aac eea etg aaa 



!** l K T 9 l~ T f" " C " C t0 ° W *«* «t gga gae act tec ag. eeg 
aaa gga gaa aat gee gca ace ace gag gtt get gat gtt att gga acg ttg gat tte aca 
KClUVATTtVAOWlCHLOri 
eag gaa gca aae gte tae ggt gtg get at* tea gge eac gaa gga aga gca gga atg gee 



ai«l aea gga aea tte • 

t tct gaa eea ctt tae cee aeg gae aac ctg aaa aag tct tM gtc eta ctg ace agg 

B *»irr«o»L««iY»fcL Tii 

* etc eae gae earn aca atg tta ggg gaa at* aM ett cm gac cct eae see tae aac 

acg cee tec eag gaa gag eg* gag ggg ggt at* cga tee eet aeg aaa egg gga 
»»«. - get aae tte aat tM ge* egt act ata ttt cee ta* tae gag ag* cm eet tec 

2!i! ate £ ^ T *" *" m Mt « tg* Mt «. c« t^ tei let att lit III 
2701 ace cm ctg gag att eag tgc ata ace aag tat ttt ece ca* eae taa ag* ttt taa ata 
!~ !!! !!! !!" 009 ett TO * caa tea cea aaa acg tac ccc cca ata age i 



2401 
24*1 



2921 ect Mt 
3SU 



2741 

2*21 cct aac ttt 



i taa aag att aaa ttt t*e eg* aaa aaa aaa aaa aaa a** aaa ctg i 
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111 A.a. MLLBWLTVtOC . 



100 



200 



300 



400 



3 
2 

1 - 
0 

•1 - 
-2 - 
-3 



1 • • i 1 1 ' 1 1 ■ t ' 1 1 1 ' ' ' i ' ' i ' j . i » 1 1 » * 1 1 1 1 1 1 1 1 1 1 r 



4 I t I ' l 



20 0 ' f ' 1 f 3^ ' ' ' ' ? ' ' ' ' ' ' ' ' ' 1 ' 1 ' ' ' ' ' f ' ' f f L 



400 



500 



0 

•1 
-.2 

-4 



419 Amino Acid* MM , 



A *1* 

C cya 
D up 
t OiU 

r pba 

c giy 

k tila 
x Ua 
K lya 
C l«u 

N un 

P DTD 
0 fflA 

a at? 
8 aar 

T tiar 

V V4i 

H crp 
z ulw 
r cyr 

r — 



alartna 
cyscuna 
asparcic acid 
glttCaic acid 
pnsylalanina 

hiecidioa 
iiolauclna 
lyaina 



proilna 
ffiutmna 
at gtntna 
sarlA* 
thraoaa na 



33 

14 
34 
31 
34 
44 
13 
37 
40 
75 
11 

21 

21 

16 

27 

40 

30 

51 



70QS6 Dal con 



2.3 
2.3 
S.5 
3.0 
5.3 
?.l 
2.1 
8.0 

7.9 
12.1 

1.8 

3.4 

3.4 

2.5 

4.4 

6.S 

4.8 

8.2 



eyrotina 
STOP 



£344 

1442 

3910 

4000 

3060 

2308 

1781 

4384 

6149 

8481 

1443 

2394 

2038 

230S 

4214 

3481 

3032 

3052 

204« 

4239 



3.3 
2.1 
5.6 
5.7 
7.1 
3-S 
2.5 
S.O 
8.8 

12.1 

2.1 

3.4 

2.9 

3.3 

6.0 

5.0 

4.3 

7.2 

2.9 



35-B 
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OH eld hsnowpep 



Decoration -Decoration Sh^te (with aolid brig* yellow) residues that nmch the Ctnsmsu, exactly. 



Cje RtBIC^ I 0 E K M B hsf7CTP€pep 



hsFATPlpep 



WO 99/36537 



PCT/US99/00182 



36/117 



50 




VvqvMj. 37 
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min 

a 293 vector control: 23 pmol/(min* 1*1 0 6 cells) 
o 293 FATP4 clone 7: 73 pmol/(min*r 1fj6 cells) 

F,«. 32 
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1 


mmFATP4 


1 


hsFATPI 


1 


hsFATP4 


46 


mmFATP4 


46 


hsFATPI 


48 


hsFATP4 


93 


mmFATP4 


93 


hsFATPI 


95 


nSr A 1 r 1 * 


140 


mmFATP4 


140 


hsFATPI 


142 


hSrATP4 


187 


mmFATP4 


1B7 


hsFATPI 


1B9 


L-d TP A 

nsr« i r«» 


233 


mmFATP4 


233 


hsFATPI 


236 


hsFATP4 


280 


mmFATP4 


280 


hsFATPI 


283 


hsFATP4 


327 


mmFATP4 


327 


hsFATPI 


330 


hsFATP4 


374 


mmFATP4 


374 


hsFATPI 


377 


hsFATP4 


421 


mmFATP4 


421 


hsFATPI 


424 


hsFATP4 


468 


mmFATP4 


466 


hsFATPI 


471 


hsFATP4 


515 


mmFATP4 


515 


hsFATPI 


518 


hsFATP4 


562 


mmFATP4 


562 


hsFATPI 


565 


hsFATP4 


609 


mmFATP4 


609 


hsFATPI 


612 






ran - : 




MBS '. 


■id n ' n 




/'.mm' .: • 


n vt 




: H L 1lhTfi' K ■ 




rn^m 




wm 








WO 99/36537 



PCT/US99/00182 



39/117 




CO CM 



Met uptake (cpm/min) 

uido 
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mmPATP4 DNA sequence 

ATGCTCCTrGGAGCCTCTCraG'rGGGGGCGCTACTGTIVlCCAAGCTAGTG ^ 
eTCC L^TlTJ CTCCTGTAC l T IS GGQTCTGGTG^ 

TTGGTGG CATGGTGCTCCTGAA3GTGAAGACCAAGGTGCGACGGTACCTTCAGGAG CGGAAG ACGGTG C CCCTGCTGTTT 
GCTTCAATGGTACAGCGCCACCCGGACAAGACAGCCCTGATTTTCGAGG 
GGATGAGTACTCCAGTAGTGTGGCCAACTTCCTGCAGGCCCGGGGC 
AAAACCGCAATGAGTTTGTGGGTCTGTGGCTAGGCATGGCCAAGCTGGGCGTGGAGG 

AGGCGGGATGCCCTGCGC CACTGT CTTGACACCTCAAAGGCACGAGCTCTCATCTTICGCAGTGAGATGGCCTCAGCTAT 

CTGTGAGATCCATGCTAGCCTGGAGCCCAgACTCAGC^ 

GCACAGAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAAGCACCTG^ 

CTCTTCTACATCTACACATCGGGCACCACGGGGCTACCCAAAG CTG CCATTGTGGTGCACAGCAGGTATTATCGTATGGC 
1TCCCTGGTGTACTATGGATTCCGCATGCGGCCTGATGACATTGTCTATGACTGC 

AACATCGTGGGGATTGGCAGTGCTTACTCCACGGCATGACTGTGGTGATCCGGAAGAAGTTCTCAGCCTCCCGGTTCTGG 

GATGATTGTATCAAGTACAACTGCACAGTGGTACAGTACATTGGCGAGCTCT 

TGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCACTGGGCAACGGTCTCCGGCAGTCCATCT 

GTTTCCACATCCCCCAGGTGGCTGAGTTCTATGGGGCCACrrGAATGCAACTGTAGCCTGGGCAACTrTGA 

GGGGCCTGTGGCTTCAATAGCTOCATCCTGTCCTTTGTGTACCCTATCCGTTTGGTACGTGTCAATGAGGATACCATGGA 

ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAACCAGGTCAGCCAGGCCAGCTGGTGGGTCGC ATC ATCCA^ 

AGGACC CTCTGCGCCGTTTCGACGGGTACCTCAACCAGGGTGCCAACAACAAGAAGATTGCTAATGATGTCrT 

GGGGACCAAGCCTACCTCACTGGTGACGTCCTGGTGATGGATGAGCTGGGTTACCTGTACTTCCGAGATCGCACTGGGGA 

CACGTTCCGCTGGAAAGGGGAGAATGTATCTACCACTGAGCTGGAGG 

TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCGAG C AGGAATGGCTGCCGTTGCAAGTCCCATCAG CAACTGT 

GACCTGGAGAGCTTTGCACAGACCTTGAAAAAGGAGCTGCCrcrrGTATGCCCGCCCCATC^ 

GCTGCACAAGACAGGGACCTTCAAGTTCCAGAAGACAGAGTTGCGGAAGGAGGGCTTTGACCCATCTGTTGTGAA 

CGCTGTTCTATCTGGATGCTCGGAAGGGCTGCTACGTTGCACTGGACCAGGAGGCCTATACCCGCATCCAGGCAGGCGAG 

GAGAAGCTGTGATTTCCCCCTACATCCCTCTGAGGGCCAGAAGATGCTGGATTCAGAGCCCTAGCGTCCACCCCAGAGGG 

TCCTGGGCAATGCCAGACCAAAGCTAGCAGGGCCCGCACCTCCGCCCCTAGGTGCTGATCTCCCCTCTCCCAAACTGCCA 

AGTGACTCACTGCCGCTTCCCCGACCCTCCAGAGGCTTTCTGTGAAA^ 

TGCCCCCTCGCCCCAGGGTTTCTGATAGGCTC CrrTA G^ 

TCACTAAGATCCCTCCAATCAGAAGGGAGCTTACAAAGGAACCAAGGCAAAGCCTGTAGACT 

GAG ACTATAGTGGCCAGTCATCCCATGTCCACAG AGG ATCTTGGTCCAG AGCTG CCAAAGTGTCACCTCTCCCTG CCTGC 



ACCTCTGGGGAAAAGAGGAC AGCATGTGGC CACTGGG CACCTGTCTCAAGAAGTCAGGATC ACACACTCAGTCCTTGTTT 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACGAGTGTCCT 

TTGCTTCTCCATCTGTCCTAGCCTGAGTGTGGGTGGAACAGGCATGAGGAGAGTGTGGCTCAGGGGCCAATAAACTCTGC 
CTTGACTC CTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



mmFATP4 protein sequence 
MIiLGASLVGALLFSKLVIiKLPWTOVGFSLLLL^ 

RRDALJUICLDTSKARALIFGSEMASAICEIHASLEPTLSLFCSGSWEPS^ 
LPYIYTSGTTGLPKAAIVVHSRYYRMASLVYYGFllMRPDDIVYDCLPL^ 

^^^^^^^^^^^^^^^^ 

EXL 
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hsFATPI full lenght.PNA 



10 20 30 40 

.... > I 1 ■ . « « 1 . . ■ t I I I I t T I I t I I T I I t I 

TCGACCCACGGCGTCCGGGACCCCAAAGCAGAAGCCCGCA 40 
CAGTAGGCACAGCGCACCC AAGAAGGGTCCAGGAGTCTGC 80 
AGAAACAGAAAGGTCCCCGGCC TCAGCCTCCTAGICCCTG 120 
CCTGCCTCCTGCCTGAGC TTCTGGGAGACTGAAGGCACGG 160 
CTTGCAGCTTCAGGAT6CGGGC TCCGGGTGCGGGCGCGGC 200 

210 220 230 240 

■ ... I .... 1 .... 1 i I t i i i 1 i r it 1 T I i I I 1 » t i \ 

CTCGGTGGTCTCGCTGGCGCTGTTGTGGCTGCTGGGGCTG 240 
CCGTGGACCTGGAGCGCGGCAGCGGCGCTCGGCGTGTACG 280 
TGGGCAGCGGCGGCTGGCGCTTCCTGCGCATCGTCTGCAA 320 
GACCGCGAGGCGAGACCTCTTCGGTCTCTCTGTGCTGATC 360 
CGCGTGCGCCTGGAGCTGCGGCGGCACCAGCGTGCCGGCC 400 

410 420 430 440 

i i i i I . ■ > > . . ■ f » » » » ' t . . . I » » ■ ■ I > . » ■ 1 

ACACCATCCCGCGCATCTTTC AGGCGGTAGTGCAGCGACA 440 
GCCCGAGCGCCTGGCGCJGGTGGA7GCCGGGACCGGCGAG 480 
TGCTGGACCTTTGCGCAGCTGGACGCCTACTCCAATGCGG 520 
TAGCCAACCTCTTCCGCCAGCTGGGCTTCGCGCCGGGCGA 560 
CGTGGTGGCCATCTTCCTGGAGGGCCGGCCGGAGTTCGTG 600 

610 620 630 640 

■ ■■.».... i .... i .... I .... t ■ i ■ U ■ » 1 

GGGCTGTGGCTGGGCCTGGCCAAGGCGGGCATGGAGGCCG 640 
CGCTGCTCAACGTGAACCTGCGGCGCGAGCCCCTGGCCTT 630 
CTGCCTGGGCACCTCGGGCGCTAAGGCCCTGATCTTTGGA 720 
GGAGAAATGGTGGCGGCGGTGGCCGAAGTGAGCGGGCATC 760 
TGGGGAAAAGTTTGATCAAGTTCTGC TCTGGAGACTTGGG 800 

810 820 330 840 

, . ■ . i . i i i I i i ■ i I ■ • • • 1 i i ■ i I ■ 1 i * 1 1 ■ 1 1 1 1 ' ■ 1 * 

GCCCGAGGGCATCTTGCCGGACACCCACCTCCTGGACCCG 840 
CTGCTGAAGGAGGCC 7CTACTGCCCCCTTGGCACAGATCC 880 
CCAGCAAGGGCATGGACGATCGTCTTTTCTACATCTACAC 920 
GTCGGGGACCACCGGGCTGCCCAAGGCTGCCATTGTCGTG 960 
CACAGCAGGTACTACCGCATGGCAGCCTTCGGCCACCACG 1000 

1010 1020 1030 1040 

.... I . , . t > . . | r I . . . T 1 ■ ... I .... I t t ■ ■ t t i i t I 

CCTACCGCATGCAGGCGGC TGACGTGCTCTATGACTGCCT 1040 

GCCCCTGTACCACTCGGC AGGAAACATCATCGGCGTGGGG 1080 

CAGTGTCTCATCTATGGGCTGACAGTCGTCCTCCGCAAGA 1 120 

AATTCTCGGCCAGCCGCTTCTGGGACGACTGCATCAAGTA 1 160 

CAACTGCACGGTGGTTCAG TAC ATCGGGGAGATCTGCCGC 1200 
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hsFATPI full ienght.DNA 



1210 1220 1230 1240 

, l ■ i i ■ i .... i * ■ i I 



, , f ■ i I I ■ ■ ■ ■ i , . . i i i . . r i i i » i — 

TACCTGCTGAAGC AGCCGGTGCGCGAGGCGGAGAGGCGAC 1240 
ACCGCGTGCGCCTGGCGGTGGGGAACGGGCTGCGTCCTGC 1280 
CATCTGGGAGGAGTTCACGGAGCGC TTCGGCGTACGCCAA 320 
ATCGGGGAGTTCTACGGCGCCACCGAGTGCAAC TGCAGCA 1360 
TTGCCAACATGGACGGCAAGGTCGGCTCCTGTGGTTTCAA 1400 

H 10 1^20 1430 1440 

.-L ' ' t— A— 1 



CAGCCGCATCCTGCCCC ACGTGTACCCCATCCGGCTGGTG 1440 
AAGGTCAATGAGGACACAATGGAGCTGCTGCGGGATGCCC 1480 
AGGGCCTCTGCATCCCCTGCCAGGCCGGGGAGCCTGGCCT 520 
CCTTGTGGGTCAGATCAACCAACAGGACCCGCTGCGCCGC 1560 
TTCGATGGCTATGTCAGCGAGAGCGCCACCAGCAAGAAGA 1600 

1610 1620 1630 1640 

i ' , , , I . . , ■ I » . . . i 

TCGCCCACAGCGTCTTCAGCAAGGGCGACAGCGCCTACCT 1640 
CTCAGGTGACGTGCTAGTGATGGATGAGCTGGGCTACATG 1680 
TACTTCCGGGACCGTAGCGGGGACACCTTCCGCTGGCGAG 1720 
GGGAGAACGTCTCCACCACCGAGGTGGAGGGCG i GCTGAG 760 
CCGCCTGCTGGGCCAGACAGACGTGGCCGTCTATGGGGTG 1800 

1810 1820 1830 1840 



GCTGTTCCAGGAGTGGAGGGTAAGGCAGGGATGGCGGCCG 1840 
TCGCAGACCCCCAC AGCCTGCTGGACCCCAACGCGATATA 1880 
CCAGGAGCTGCAGAAGGTGC TGGC ACCCT ATGCCCGGCCC 1920 
ATCTTCCTGCGCCTCCTGCCCCAGGTGGACACCACAGGCA 1960 
CCTTCAAGATCCAGAAGACGAGGCTGCAGCGAGAGGGCTT 2000 

2010 2020 2030 2040 

, , , ■ ■ i r ... i .... > ■ . ■ I i i i ■ 1 » t ■ ■ I 

TGACCCACGCCAGACCTCAGACCGGCTCTTCTTCCTGGAC 2040 
CTGAAGCAGGGCCACTACCTGCCCTTAAATGAGGCAGTCT 2080 
ACACTCGCATCTGCTCGGGCGCCTTCGCCCTCTGAAGCTG 2 20 
TTCCTCTACTGGCCACAAACTCTGGGCCTGGTGGGAGAGG 2160 
CCAGCTTGAGCCAGACAGCGCTGCCCAGGGGTGGCCGCCT 2200 

2210 2220 2230 2240 

» ■ , ■ i .... t .... 1 



i, i i t i t i t I i i i i 1 



AGTACACACCCACCTGGCCGAGCTGTACCTGGCACGGCCC 2240 
ATCCTGGACTGAGAAACTGGAACCTCAGAGGAACCCGTGC 2280 
CTCTCTGCTGCCTTGGTGCCCCTGTGTCTGCCTCCTCTCC 2320 
CTGCTTTTCAGCCTCTGTCTCCTTCCATCCCTGTCCCTGT 2360 
CTGGCCTTAACTCTTCCCTCTCTTTCTTTTCTTTCT.iC. 2400 

2410 2420 2430 2440 

i | t , r . i . . ■ . t i ■ t i I i i i i 1 



TTCTTTTTTTTTAAGATAGAGTCTCACTCTGCTGCCCGGG 2440 
CTAGAGTGCAGTGGTGGGATCTCGGC TCACTGCAACCTCT 2480 
GCCTCCTGGGGTTCAAGTGATCCTCCCACCTCAGCCTCCT 2520 
GAGTAGCTGGGATTACAGGCACCCGCCACCACGTCCAGCT 2560 
AATTTTTATATTTTTAGT AGAGACGGGGTTTCACC ATGTT 2600 



WO 99/36537 
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43/117 

hsFATPI Ml lengfit.PNA 

2610 2620 2630 2640 
. ... i .... i .... i ■<■ » t . . . ■ i .1.1 I .... t ... ■ i 

GGTCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCGC 2640 
TGGCCTCGGCCTCCCAGAGTGCTGGGATTATAGGCGTGAG 2680 
CCTCTGGCCCGGCCTTTCCTTTTTCCTCTCCTCTCCTGCC 2720 
GAGAGTGGAACACACGTGTCCTGGGAGCTGCATCTTGTGT 2760 
AGGGTCCAGCTGCTTTTGGGGACTGCAGGAATCATCTCCC 2800 

2810 2820 2830 2840 

■ ■ * * I .... i ... . t .... I . . . . t .tii I i t i i I t i i i I 

CTGGGCCCTGGACTCGGACTGGGGCCTCCCCACCTCCCTC 2840 
TCGGCTGTGCCTTACGGAGCCCCAATCCAGGCCrCCTGTG 2880 
GCTGTTGGGTTCCAGATGCTGCAGCTCCATGTGACTTCCA 2920 
AGCAGGCCCTCCGCCCTCCCTGCTGAATGGAGGAGCCGGG 2960 
GGTCCCCCAGGCCAACTGGAAAATCTCCCAGGCTAGGCCA 3000 

3010 3020 3030 3040 

. ... i .... i .... f . * .. I ,,. i I ... i I i i I 

ATTGCCTTTTGCACTTCCCCGTTCCTGTCACATTTCCCCA 3040 
GCCCCACCTTCCCCTCCTGATGCCCTGAAAGCTTCCGGAA 3080 
TTGACTGTGACCACTTGGATGTCACCACTGTCAGCCCCTG 3120 
CCTTGATGTCCCCATTTAGCCATCTCCATGGAGCTCCTGC 3160 
TGGAGGGCCCTGAACCCTGCACTGCGTGGCTGCCCAGCCA 3200 

3210 3220 3230 3240 

i i .... i .... i .... t ... . i .... t 

GCTGCCTCCTGTCCTGGGAGGAGGCCTCCTGGGTGTCCTC 3240 
ATCTGGTGTGTCTACTGGAGGGTCCCACAGGAGAGGCAGC 3280 
AGAGGGGTCAGGGGAGGTCTCCTGCCGGGGGTTGGCCTCT 3320 
CAAGCCTCAGGGGTTCTAGCCTGTTGAATATACCCCACCT 3360 
GGTGGGTGGCCCCTCCGATGTCCCCACTGATGGCTCTGAC 3400 

3410 3420 3430 3440 

.... I I .... I .... F .... t ... . I . ... I ... . I 

ACCGTGTTGGTGGCGATGTCCCAGACAATCCCACCAGGAC 3440 
GGCCCAGACATCCCTACTGGCTTCGCTGGTGGCTCATCTC 3480 
GAACATCCACGCCAGCCTTTC TGGGGCCGGCCACCCAGGC 3520 
CGCCTGTCCGTCTGTCCTCCCTCCAGCAGCACCCCCTGGC 3560 
CCCTGGAGTGGTGGGGCCATGGCAAGAGACACCGTGGCGT 3600 

3610 3620 3630 3640 

....<■..■ ' .... i ... ♦ I ... . t . i t ■ I i . . . i i ■ . i I 

CTCATGTGAACTTTCCTGGGCACTGTGGTTTTATTTCCTA 3640 
ATTGATTTAAGAAATAAACCTGAAGACCGTCTGGTGAAAA 3680 
AAAAAAAAAAAAAA 3694 



fu. <HC 



WO 99/36537 



44/117 



hsFATPI full lenqht.protein 



10 20 3 ,° 

r i i i i i I i i i i 1 1 ■ ' ' ■ 1 ■ ' ' ■ 1 1 1 ' 1 1 



NLRREPLAFCLGTSGAKALIFGGEMVAAVAEVSGHLGKSL 200 

210 220 230 240 
■ .... i I ■ - ■ i .... i ... i I - 



KFCSGDLGPEG LPOTHLLDPLLKEASTAPLAO I PSKGM 240 
DORLF Y I YTSGTTGLPKAA I VVHSRYYRMAAFGHHAYRMQ 280 
Iahvi vnCLPLYHSAGNI I GVGQCL I YGLT VVLRKKFSAS 320 
™^KYNCTVV$Y!GEICRYLLKQPVREAERRHRVRC 
AVgSg[rpIiWEEF?ERFGVRQIGEFYGATECNCSIANMO 400 

nir\ li MA 



410 420 430 440 



GKVGSCGFNSR 1 LPHVYP I RLVKVNEDTMELLRDAQGLC I 440 
orniPPPrL LVGQ INQQDPLRRFOGYVSESATSKK I AHSV 480 
FSKGdIaYLSGoSlVmSeLGYMYFRDRSGDTFRWRGE^ 520 
TTEVEGVLSRLLGQTDVAVYGVAVPGVEGKAGMAAVAOPH 560 
SLLDPNA I YQELQKVLAPYARP I FLRLLPOVDTTGTFK IQ 600 
610 620 630 640 

t , i . t . . I , , M i , , | . i , i | | i . 1 t • ! ♦ * i i 1 i • • 1 i - 

KTRLQREGFDPRQTSORLFFLDLKQGHYLPLNEAVYTR I C 640 

SGAFAL. 647 
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hsVLACS full lenght.DNA 



10 20 30 40 

t « « « i « ■ ■ * i « » ■ i 1 ' 1 ■ » 1 » * * * 1 * ■ 1 1 i 11 » * 1 " ■ ■ i 
GGAATTCCAAAAAAAAAAAATACGACTACACCTGCTCCGG 40 
AGCCCGCGGCGGTACC TGCAGCGGAGGAGCTCTGTCTTCC 80 
CCTTCATCTCACGCGAGCCCGGCGTCCCGCCGCGTGCGCC 120 
CCGGCGCAGCCCGCCAGTCCGCCCGGAGCCCGCCCAGTCG 160 
CCGCGCTGCACGCCCGGGGTGAACCCTCTGCCCTCGCTGG 200 

210 . 220 230 " 240 

. ... t .... l » ... t .... t ...» i , ... I , ■ ■ I 

GACAGAGGGCCCCGCAGCCGTCATGCTTTCCGCCATCTAC 240 
ACAGTCCTGGCGGGACTGCTGTTCCTGCCGCTCCTGGTGA 280 
ACCTCTGCTGCCC ATACTIC TTCCAGGACATAGGCTACTT 320 
CTTGAAGGTGGCCGCCGTGGGCCGGAGGGTGCGCAGCTAC 360 
GGGCAGCGGCGGCCGGCGCGCACCATCCTGCGGGCGTTCC 400 

410 420 430 440 

» I i l .... i l 

TGGAGAAAGCGCGCC AGACGCCACACAAGCCTTTTCTGCT 440 
CTTCCGCGACGAGACTCTCACCTACGCGCAGGTGGACCGG 480 
CGCAGCAATCAAGTGGCCCGGGCGCTGCACGACCACCTCG 520 
GCCTGCGCCAGGGAGACTGCGTGGCGCTCCTTATGGGT.AA 560 
CGAGCCGGCCTACGTGTGGC TGTGGCTGGGGCTGGTGAAG 600 

610 620 630 640 

» » » i 1 ■ i i i t ■ i ■ ■ 1 i i ■ i 1 * i ' » 1 » « » i 1 i ■ ' » ' ■ ■ « ■ 1 

CTGGGCTGTGCCATGGCGTGCCTCAATTACAACATCCGCG 640 
CGAAGTCCCTGCTGCACTGCTTCCAGTGCTGCGGGGCGAA 680 
GGTGCTGCTGGTGTCGCCAGAACTACAAGCAGCTGTCGAA 720 
GAGATACTGCCAAGCCTTAAAAAAGATGATGTGTCCATCT 760 
ATTATGTGAGCAGAACTTC TAACACAGATGGGATTGACTC 800 

810 820 830 840 
.... i ... . ? .... t ... . i .... i .... > .... > .... i 

TTTCCTGGACAAAGTGGATGAAGTATCAACTGAACCTATC 840 
CCAGAGTCATGGAGGTCTGAAGTCACTTTTTCCACTCCTG 880 
CCTTATACATTTATACTTCTGGAACCAC AGGTCTTCCAAA 920 
AGCAGCCATGATCAC TCATC AGCGCAT ATGGTATGGAAC T 960 
GGCCTCACTTTTGTAAGCGGATTGAAGGCAGATGATGTCA 1000 

1010 1020 1030 1040 

I ' t .... I I ) I 

TCTATATCACTCTGCCCTTTTACCACAGTGCTGCACTACT 1040 
GATTGGCATTCACGGATGTATTGTGGC TGGTGC TACTC TT 1080 
GCCTTGCGGACTAAATTTTCAGCCAGCCAGTTTTGGGATG 1 120 
ACTGCAGAAAATACAACGTCACTGTCATTCAGTATATCGG 1 160 
TGAACTGCTTCGGTATTTATGC AACTCACCACAGAAACC A 1200 
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hsVLACS full lenght.DNA 

1210 1220 1230 1240 

t t .... t .... I .... I i i ■» I 

AATGACCGTGATCATAAAGTGAGACTGGCACTGGGAAATG 1240 
GCTTACGAGGAGATGTGTGGAGAC AATTTGTCAAGAGA7T 1280 
TGGGGACATATGCATC TATGAGTTCTATGCTGCCACTGAA 1320 
GGCAATATTGGATTTATGAA TTATGCGAGAAAAGTTGGTG 1360 
CTGTTGGAAGAGTAAACTACCTACAGAAAAAAATCATAAC 1400 

1410 1420 1430 1440 

■ ■■.«...■ i ..... i i . ... I .... 1 . ... 1 

TTATGACCTGATTAAATATG ATGTGGAGAAAGATGAACCT 1440 
GTCCGAGATGAAAATGGATATTGCGTCAGAGTTCCCAAAG 1480 
GTGAAGTTGGACTTCTGGTTTGCAAAATCACACAACTTAC 1520 
ACCATTTAATGGCTATGCTGGAGCAAAGGCTCAGACAGAG 1560 
AAGAAAAAACTGAGAGATGTCTTTAAGAAAGGAGACCTCT 1600 

1610 1620 1630 1640 

. ... i .... i ... ■ f i i . i I i i > t I i i i i I i t i ■ I 

ATTTCAACAGTGGAGATCTC TT AATGGTTGACCATGAAAA 1640 
TTTCATCTATTTCCACGACAGAGTTGGAGATACAT TCCGG 1680 
TGGAAAGGGGAAAATGTGGCCACC ACTGAAGTTGCTGATA 1720 
CAGTTGGACTGGTTGATTTTGTCC AAGAAGTAAATGTTTA 1760 
TGGAGTGCATGTGCCAGATCATGAGGGTCGCATTGGCATG 1800 

1810 1820 1830 1840 

.... 1 ... . I .... * I ... . I I I » » t I I t I » . I . ■ I 

GCCTCCATCAAAATGAAAGAAAACCATGAATTTGATGGAA 1840 
AGAAACTCTTTCAGCACATTGCTGATTACCTACCTAGT7A 1880 
TGCAAGGCCCCGGTTTCTAAGAATACAGGACACCATTGAG 1920 
ATCACTGGAACTTTTAAACACCGC AAAATGACCCTGGTGG I960 
AGGAGGGCTTTAACCCTGCTGTCATCAAAGATGCCTTGTA 2000 

2010 2020 2030 2040 

■ > t .... 1 ... i I . . .. t .. .. 1 .. .. I 

TTTCTTGGATGACACAGCAAAAATGTArGTGCCTATGACT 2040 
GAGSACATCTATAATGCCATAAGTGCTAAAACCCTGAAAC 2080 
TCTGAATATTCCCAGGAGGATAACTCAACATTTCCAGAAA 2120 
GAAACTGAATGGACAGCC ACTTGATATAATCCAACTTTAA 2160 
T TTG ATTGAAGATTG TG AGG AA AT TTTGTAGG A A AT TTGC 2200 

2210 2220 2230 2240 

.... i .. •■»»■■■ 1 i ••''■«''' 1,1 1 1 11 1 f ■ 1 11 * 

ATACCCGTAAAGGGAGACTTTTTTAAATAACAGTTGAG7C 2240 
TTTGCAAGTAAAAAGATTTAGAGATTATTATT TTTCAGTG 2280 
TGCACCTACTGTTTGTATTTGCAAACTGAGCTTGT i GGAG 2320 
GGAAGGCATTATT7TTTAAAA7AC TTAGTAAATTAAATGA 2360 

AC 2362 
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hsVLACS full lenght.protein 



10 20 30 40 

. . . . i . . . . i . . • ■ » ■ ■ ■ • 1 ■ ■ * * 1 • • • • ' * ' ■ i 1 1 * i • * 

MLSAl YTVLAGLLFLPLLVNLCCPYFFQD IGYFLKVAAVG 40 
RRVRSYGQRRPART I LRAFLEKARQTPHKPFLLFRDETLT 80 
YAQVDRRSNQVARALHOHLGLRQGDCVALLMGNEPAYVWL 120 
WLGLVKLGCAMACLNYN I RAKSLLHCFOCCGAKVLLVSPE 160 
LQAAVEE i LPSLKKDDVS I YYVSRTSNTDG I OSFLDKVOE 200 
210 220 230 240 

j - i i t . . . . I ■ . ■ . t . ■ i ■ I i i i i I i > i i 1 ■ i i i I ■ ' i i I 

VSTEP I PESWRSEVTFSTPALY I YTSGTTGLPKAAM I THQ 240 
R I WYGTGL TFVSGLK ADDV I Y I TLPFYHSAALL I G I HGC I 280 
VAGATLALRTKFSASQFWDDCRKYNVTV I QY I GELLRYLC 320 
NSPQKPNDRDHKVRLALGNGLRGDVWRGFVKRFGOI C I YE 360 
FYAATEGN I GFMNYARK VGAVGRVNYLGKK I t TYDL I KYD 400 

410 420 430 440 

.... i .... i .... » .... 1 ..»» i i i ■ i I i i « i 1 « » i i I 

VEKDEPVRDENGYCVRVPKGEVGLLVCK I TQLTPFNGYAG 440 
AKAOTEKKKLRDVFKKGDLYFNSGOLLMVDHENF I YFHDR 480 
VGDTFRWKGENVATTEVAOTVGLVOF VQEVNVYGVHVPOH 520 
EGR IGMAS I KMKENHEFOGKKLFQH I ADYLPSYARPRFLR 560 
I GOT I E ! TGTFKHRKMTLVEEGFNPAV I KOALYFLDDTAK 600 

610 620 630 640 

. . . . i . . . . I > . . ■ I ♦ . . . > . . t . I ii i. I i t i i I i i i i i 

MYVPMTEO I YNAI SAKTLKL. 621 
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A8/117 

hsFATP3 partial.DNA 



10 20 30 40 

.... t . . . . I . , . . » • ■ ■ ■ I ♦ ■ ' ' ' ' t ■ ■ > I I t ! I I 1 I I I 

AAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGC 40 
ACAGGGTGACGGTGTTCCAGTACATTGGGGAGCTGTGCCG 80 
ATACCTTGTCAACCAGCCCCCGAGCAAGGCAGAACGTGGC 120 
CATAAGGTCCGGCTGGCAGTGGGCAGCGGGC TGCGCCCAG 160 
ATACCTGGGAGCGTTTTGTGCGGCGCTTCGGGCCCCTGCA 200 

210 220 230 240 

.... t .... i ..., «.... i ■ i ■ ■ ♦ ♦ 1 ■ » ■ ■ i » ■ ' « I 

GGTGCTGGAGACATATGGAC TGACAGAGGGCAACGTGGCC 240 
ACCATCAACTACACAGGAC AGCGGGGCGCTGTGGGGCGTG 280 
CTTCCTGGCTTTACAAGCATATCTTCCCCTTCTCCTTGAT 320 
TCGCTATGATGTCACC ACAGGAGAGCCAATTCGGGACCCC 360 
CAGGGGCACTGTATGGCCAC ATCTCCAGGTGAGCCAGGGC 400 

410 420 430 440 

T . . . I . . > . 1 ■ • ■ ■ I » ■ » . I » ■ ■ l f t I 1 t If I t I I t I t I 1 

TGCTGGTGGCCCCGGTAAGCCAGCAGTCCCC ATTCCTGGG 440 
C TATGCTGGCGGGCCAGAGC TGGC CC AGGG G A AG TTGCT A 480 
AAGGATGTCTTCCGGCCTGGGGATGTTTTCTTCAACACTG 520 
GGGACCTGCTGGTCTGCGATGACCAAGGTTTTCTCCGCTT 560 
CCATGATCGTACTGGAGACACCTTCAGGTGGAAGGGGGAG 600 

610 620 630 640 

■ i i i i i ■ ■ ■ 1 ■ i i i 1 i i ' ' 1 » i ' ■ 1 ■ ■ ' ' 1 1 1 1 ■ 1 ' ■ 1 ' 1 

AATGTGGCCACAACCGAGGTGGCAGAGGTCTTCGAGGCCC 640 
TAGATTTTCTTCAGGAGGTGAACGTCTATGGAGTCACTGT 680 
GC C AGGG CATGAAGGC AGGG C TGGAA TGGC AGCCC I AGTT 720 
CTGCGTCCCCCCCACGCTTTGGACCTTATGCAGCTCTACA 760 
CCCACGTGTCTGAGAACTTGCCACCTTATGCCCGGCCCCG 800 

810 820 830 840 

. . ■ . i . . ■ ■ i , , , ■ » 1 i .... I , ... I ...» I 

ATTCCTCAGGCTCCAGGAGTCTTTGGCCACCACAGAGACC 840 
TTCAAACAGCAGAAAGTTCGGATGGCAAATGAGGGCTTCG 880 
ACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGACCA 920 
GGCTGTAGGTGCCTACCTGCCCC TC ACAACTGCCCGGTAC 960 
AGCGCCCTCCTGGC AGGAAACCTTCGAATCTGAGAACTTC 1000 

1010 1020 1030 1040 

■ ■•■»•■■■ i ■ ■ ' i ..». 1 .... I ... i.l 

CACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGG 1040 
GGCCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTA 1080 
TACCAGAACTGCGGTCAC TATTTIGTAATAAATGIGGCTG 1 120 
GAGCTGATCCAGC TGTCTC TGAC AAAAAAAAAAAAAAAAA 1 160 
AAAGGGCGGCCGC 1 173 
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49/118 

hsFATP3Dartial.protein 



10 . 20 30 40 

. ... i .... i .... i .... » »■ * t i . i ■ ■ 1 i ■ ■! I tiii I 

KFSAGQFWEOCQQHRVTVFQY I GELCRYLVNOPPSKAERG 40 
HKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVA 80 
T I NYTGQRGAVGRA5WLYKH I FPFSL I RYDVTTGEP I ROP 120 
QGHCMATSPGEPGLLVAPVSQOSPFLGYAGGPELAQGKLL 160 
KDVFRPGDVFFNTGDLLVCODQGFLRFHORTGDTFRWKGE 200 

210 220 230 240 

.... i ... ■ I .... t . . . . < i . i i 1 t i i t 1 i i i i I i i !■ 1 

NVATTEVAEVFEALDFLOEVNVYGVTVPGHEGRAGMAALV 240 
LRPPHALDLMQLYTHVSENLPPYARPRFLRLQESLATTET 280 
FKQQKVRMANEGFDPSTLSDPLYVLOQAVGAYLPLTTARY 320 
SALLAGNLRI. 331 
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hsFATP4 full length , 



10 20 30 «K> 

i .1 



CGACCCACGCGTCCGGGCGGGCGGGGCCGGGCGGCGGGCG 40 
GGGCTGGCGGGGCGGCCGGGCCATGCAGGGCGCAGAGCCG 80 
GCTAAACCCTGCTGAGACCCGGCTCCGTGCGTCCAGGGGC 20 
rrcTAATGCCCCTCACGCTGTCTACGCTGCTGCAACCGGG 160 
CCGCATCTGGACGGGGCGCCGCGCGGCGGAGCCGACGCCG 200 
Tin OH 



210 220 230 240 

i , . .. i .... i i ■ 



GGCCACAATGCTGCTTGGAGCCTCTCTGGTGGGGGTGCTG 240 
CTGTTCTCCAAGCTGGTGCTGAAACTGCCCTGGACCCAGG 280 
TGGGATTCTCCCTGTTGTTCCTCTACTTGGGATCTGGCGG 320 
CTGGCGCTTCATCCGGGTCTTCATCAAGACCATCAGGCGC 360 

gItaktttggcggcctggtcctcctgaaggtgaaggcaa 400 

410 420 430 440 

I 1 • I ' ' 1 I I 1 ' 1 



. ... I .... I .... 1 I I I ■ 1 t I I I I " 

ArrTGCGACAGTGCCTGCAGGAGCGGCGGACAGTGCCCAT 440 
???£mGCCTCTACC6TTCGGC « 80 

GCCAGCTGGATGAGTACTCAAGCAGTGTAGCCAAC i TCCT 560 
GCAGGCCCGGGGCCTGGCCTCGGGCGATGTGGCTGCCATC 600 

610 620 630 640 

I , I I I I 1 1 1 1 1 11 1 



TTCATGGAGAACCGCAATGAGTTCGTGGGCCTATGGCTGG 640 
GCATGGCCAAGCTCGGTGTGGAGGCAGCCCTCATCAACAC 680 
CAACCTGCGGCGGGATGCTCTGCTCCACTGCCTCACCACC 720 
TrGCGCGCACGGGCCCTTGTCTTTGGCAGCGAAATGGCCT 760 
CAGCCATCTGTGAGGTCCATGCC AGCCTGGACCCCTCGCT 800 



810 e20 830 840 

I » t I > , r 



CAGCCTCTTCTGCTCTGGCTCCTGGGAGCCCGGTGCGGTG 840 
CCTCCAAGCACAGAACACCTGGACCCTCTGCTGAAAGATG 880 
CTCCCAAGCACCTTCCCAGTTGCCCTGACAAGGGCTTCAC 920 
AGATAAACTGTTCT ACATCTACACATCCGGCACCACAGGG 960 
CTGCCCAAGGCCGCCATCGTGGTGCACAGCAGGTATTACC 1000 



1010 1020 1030 1040 

■ ■ . . ■ ■ ■ ■ , ■ 1 ■ ■ . ■ I ■ . ■ ■ 1 ■ ■ ■ ■ t ■ ■ ' ■ I- 



GCATGGCTGCCCTGGTGTACTATGGATTCCGCATGCGGCC 040 
C A ACG AC A TCG TC T A TG AC TGC C T CC CCCTCTACC AC TC A 1080 
GCAGGAAACATCGTGGGAATCGGCCAGTGCCTGCTGCATG 1 120 
GCATGACGGTGGTGATTCGGAAGAAGTTCTCAGCCTCCCG 160 
GTTCTGGGACGATTGTATCAAGTACAACTGCACGATTGTG 1200 



WO 99/36537 
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51/117 

hsFATP4 full length 

1210 1220 1230 1240 

.... t .... i .... i .... I « I I 1 1 I I I I 1 I T I I I I I I I I 

CAGTACATTGGTGAACTGTGCCGCTACCTCCTGAACCAGC 1240 
CACCGCGGGAGGCAGAAAACCAGCACCAGGTTCGCATGGC 1280 
ACTAGGCAATGGCCTCCGGCAGTCCATCTGGACCAACTTT 1320 
TCCAGCCGCTTCCACATACCCCAGGTGGCTGAGTTCTACG 1360 
GGGCCACAGAGTGCAACTGTAGCCTGGGCAACTTCGACAG 1400 
1410 1420 1430 1440 

■ ■ ! . | ♦ ' I I ' ■ ' I ' 1 I I I I 1 I ' I ' 1 ' ' 1 ' 1 1 1 1 1 ' ' 1 ' ' 1 

CCAGGTGGGGGCCTGTGGTTTCAATAGCCGCATCCTGTCC 1440 

TTCGTGTACCCCATCCGGTTGGTACGTGTCAACGAGGACA 1480 

CCATGGAGCTGATCCGGGGGCCCGACGGCGTCTGCATTCC 1 520 

CTGCCAGCCAGGTGAGCCGGGCCAGCTGGTGGGCCGCATC 1560 

ATCCAGAAAGACCCCCTGCGCCGCTTCGATGGCTACCTCA 1600 

1610 1620 1630 1640 

■ . . i i . . ■ « i ■ « » . i . . . i i . . ■ ■ t i t i i I i > i i t » i i i I 

ACCAGGGCGCCAACAACAAGAAGATTGCCAAGGATGTCTT 1640 
CAAGAAGGGGGACCAGGCCTACCTTACTGGTGATGTGCTG 1680 
GTGATGGACGAGCTGGGCTACCTGTACTTCCGAGACCGCA 1720 
CTGGGGACACGTTCCGCTGGAAAGGTGAGAACGTGTCCAC 1760 
CACCGAGGTGGAAGGCACACTCAGCCGCCTGCTGGACATG 1800 

1810 1820 1830 1840 

.... r ... . I .... I ... ■ I t ■ t t 1 tii i 1 t i t i I i t i i \ 

GCTGACGTGGCCGTGTATGGTGTCGAGGTGCC AGGAACCG 1840 
AGGGCCGGGCCGGAATGGCTGCTGTGGCCAGCCCCACTGG 1880 
CAACTGTGACCTGGAGCGC TTTGCTCAGGTCTTGGAGAAG 1920 
GAACTGCCCCTGTATGCGCGCCCCATCTTCC TGCGCCTCC 1960 
TGCCTGAGCTGCACAAAACAGGAACCTACAAGTTCCAGAA 2000 

2010 2020 2030 2040 

, ... i .... I .... i . ... t .... t ■ i t i 1 .... I . i . . ! 

GACAGAGCTACGGAAGGAGGGCTTTGACCCGGCTATTGTG 2040 
AAAGACCCGCTGTTCTATC TAGATGCCCAGAAGGGCCGCT 2080 
ACGTCCCGCTGGACCAAGAGGCCTACAGCCGCATCCAGGC 2120 
AGGCGAGGAGAAGCTGTGATTCCCCCCATCCCTCTGAGGG 2160 
CCGGCGGATGCTGGATCCGGAGCCCCAGGTTCCGCCCCAG 2200 

2210 2220 2230 2240 

i ... ■ f ■ . . » l » » ■ . I ■ ■ ■ ■ 1 »» i . 1 . « « J 

AGCGGTCCTGGACAAGGCC AGACCAAAGCAAGCAGGGCCT 2240 
GGCACCTCCATCCTGAGGTGCTGCCCCTCC ATCCAAAACT 2280 
GCCAAGTGACTCATTGCCTTCCC AACCCTTCC AGAGGCTT 2320 
TCTGTGAAAGTCTCATGTCCAAGTTCCGTCTTCTGGGCTG 2360 
GGCAGGCCCTCTGGTTCCCAGGC TGAGACTGACGGGTTTT 2400 

2410 2420 2430 2440 

. ... » i .... t .... I ■ i ■ ' 1 ■ i l ■ i ■ i 1 

CTCAGGATGATGTCTTGGGTGAGGGTAGGGAGAGGACAAG 2440 
GGGTCACCGAGCCCTTCCCAGAGAGCAGGGAGCTTATAAA 2480 
TGG AACC AG AGC AG AAG TC C CC A G AC TCAGG A AG TCAAC A 2520 
GAGTGGGCAGGGACAGTGG TAGC ATCC ATC TGGTGGCCAA 2560 
AG AG AATCG T AGC C CC AG AGC TGC CCA AG TTC AC TGGGCT 2600 
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52/117 

hsFATP* full length . — 



i i i 



2610 2620 2630 2640 

i i - » ■ i ■ * i i y tt**^ iitii 1 1 1 1 i t 1 1 1 1 — 



CCACCCCCACCTCCAGGAGGGGAGGAGAGGACCTGACATC 2640 
TGTAGGTGGCCCCTGATGCCCCATCTACAGCAGGAGGTCA 2680 
GGACCACGCCCCTGGCCTCTCCCCACTCCCCCATCCTCCT 2720 
CCCTGGGTGGCTGCCTGATTATCCCTCAGGCAGGGCCTCT 2760 
CAGTCCTTGTGGGTCTGTGTCACCTCCATCTCAGTCTTGG 2800 

2810 2820 2830 2840 

, . . ■ i . . ■ - | i ■ • i ' r • ■ • 1 - i ■ » 1 1 ' 1 1 1 1 ' 1 1 1 1 ' 1 1 1 

CCTGGCTATGAGGGGAGGAGGAATGGGAGAGGGGGCTCAG 2840 
GGGCCAATAAACTCTGCC TTGAGTCCTCCTAAAAAAAAAA 2880 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 2907 
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hsFATP4 full length, protein 



10 20 30 HO 

■ I ■ • • ■ ' i r ■ • 1 ■ t i I i i i i I i i i ' ' 



MLLGASL"GVLLFSKL VLKLPWTQVGFSLLFLYLGSGGWR 40 
F IRVF IKT I RRO t FGGLVLLKVKAKVRQCLQERRTVPILF 80 
ASTVRRHPDKTALIFEGTDTHWTFRQLDEYSSSVANFLQA 20 
RGLASGOVAA I FMENRNEFVGLWLGMAKLGVEAAL 1NTNL 1 60 
RRDALLHCLTTSRARALVFGSEMASA I CEVHASLOPSLSL 200 
210 220 230 2«0 

NGLRQS IWTNFSSRFH I PQVAEFYGATECNCSLGNFOSQ V 400 
410 420 430 <*40 

I n- 1 i .... i ... ■ I 1 1 ■ ' ' ' 

GACGFNSR ILSFV YP I RLVRVNEDTMEL IRGPDGVC IPCO 440 
PGEPGQLVGR I IQKDPLRRFOGYLNQGANNKK I AKOVFKK 480 
GDQAYLTGDVLVMOELGYLYFRDRTGOTFRWKGENVSTTE 520 
VEGTLSRLLDMAOVAVYGVEVPGTEGRAGMAAVASPTGNC 560 
OLERFAQVLEKELPLYARP IFLRLLPELHKTGTYKFQKTE 600 

610 620 630 640 

■ ■ ' ' I ■ ■ ■ ■ I i ■ ■ ■ I 

LRKEGFDPAl VKOPLFYLDAQKGRY VPLDQEAYSR I QAGE 640 

EKL 643 

f !y SI 
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>hsFATP5 (partial) 

GTCGTTGGGATCCTCGGCTGCTTAGATCTCGGAGCCACCTGTGTTCTGGCCCCCAAG 
TTCTCTACTTCCTGCTTCTGGGA 

TGACTGTCGGCAGCATGGCGTGACAGTGATCCTGTATGTGGGCGAGCTCCTGCGATA 
CTTGTGTAACATTCCCCAGCAAC 

CAGAGGACCGGACACATACAGTCCGCCTGGCAATGGGCAATGGACTACGGGCTGAT 
GTGTGGGGAGACCTTCCAGCAGCG 

TTTCGGTCCTATTTCGGATCTNGGGAAGTCTTACGGGCTTCCACAGAAGGGCAACAT 
GGGGCTTTAGTTCAAATATTGTT 

GGGGGCGCTGCGGGGCCCTGGGGGCAAAGATGGAGCITGCCTCCTCCGAATGCTGT 
CCCCCTTTGAGCTGGTGCAGTTCG 

ACATGGAGGCGGCGGAGCCTGTGAGGGACAATCAGGGCTTCTGCATCCCTGTAGGG 
CTAGGGGAGCCGGGGCTGCTGTTG 

ACCAAGGTGGTAAGCCAGCAACCCTTCGTGGGCTACCGCGGCCCCCGAGAGCTGTC 
GGAACGGAAGCTGGTGCGCAACGT 

GCGGCAATCGGGCGACGTTTACTACAACACCGGGGACGTACTGGCCATGGACCGCG 
AAGGCTTCCTCTACTTCCGCGACC 

GACTCGGGGACACCTTCCGATGGAAGGGCGAGAACGTGTCCACGCACGAGGTGGAG 
GGCGTGTTGTCGCAGGTGGACTTC 

TTGCAACAGGTTAACGTGTATGGCGTGTGCGTGCCAGGTTGTGAGGGTAAGGTGGGC 
ATGGCTGCTGTGGCATTAGCCCC 

CGGCCAGACTTTCGACGGGGAGAAGTTGTACCAGCACGTTCGCGCTTGGCTCCCrrGC 
CTACGCTACCCCCCATTTCATCC 

GCATCCAGGACGCCATGGAGGTCACCAGCACGTTCAAACTGATGAAGACCCGGTTG 
GTGCGTGAGGGCTTCAATGTGGGG 

ATCGTGGTTGACCCTCTGTTTGTACTGGACAACCGGGCCCAGTCCTTCCGGCCCCTG 
ACGGCAGAAATGTACCAGGCTGT 

GTGTGACKjGAACCTGGAGGCTCTGATCACCTGGCCAACCCACTGGGGTAGGGATCA 
AAGCCAGCCACCCCCACCCCAACA 

CACTCGGTGTCCCTrTTCATCCTGGGCCTGTGTGAATCCCAGCCTGGCCATACCCTCA 
ACCTCAGTGGGCTGGAAATGACA 

GTGGGCCCTGTAGCAGTGGCAGAATAAACTCAGMTGYGTTCACAGAAA 
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hsFATP Spartial.protein 



10 20 30 40 

i 1 ■ i .... 1 . ii i I i i i i I i i ■ ■ l i i n I 



VVG ILGCLDLGATCVLAPKFSTSCFWOOCRQHGVTV ILYV 40 
GELLRYLCN I PQQPEORTHTVRLAMGNGLRADVWGDLPAA 80 
FRSYFGSXEVLRASTEGOHGALVQI LLGALRGPGGKDGAC 120 
LLRMLSPFELVQFOMEAAEPVRDNQGFC I PVGLGEPGLLL 160 
TKVVSQQPFVGYRGPRELSERKLVRNVRQSGDVYYNTGOV 200 

210 220 230 240 
1 , , , , p ■ I t i i i I i i i i 1 



LAMDREGFLYFRDRLGDTFRWKGENVSTHEVEGVLSQVDF 240 
LQQVNVYGVCVPGCEGKVGMAAVALAPGQTFOGEKLYQHV 280 
RAWLPAYATPHF I R I QDAMEVTSTFKLMKTRLVREGFNVG 320 
I V VDPLFVLDNRAQSFRPLTAEMYQAVCEGTWRL 354 
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hsFATP6 full lenght.DNA 



10 20 30 40 

. . . . t . ... i ..■ - i » • ♦ ■ t . ■ . » t i i i i I t i t i l t i i i I 

AACGGCAAGTAAGCGCAACGCAATTAATGTGAGTAGCTCA 40 
CTCATTAGGCACCCCAGGCTTTAC ACTTTATGCTTCCGGG 80 
CTCGTATGTTGTGTGGAATTGTGAGCGGATACCAATTTCA 120 
CACAGGAACCAGCTATGACATGATTACGAATTTAATACGA 160 
CTCACTATAGGGAATTTGGCCCTCGAGGCCAAGAATTCGG 200 

210 220 230 240 

• , ... i .... i .... i .. i ■»•■■■'■-■' ' i .... I 

CACGAGGGGTGCTGAGCCCCTGCGCGGTTTCTGGTGCGTA 240 
GAGACTGTAAATCGCTGCGC TTCTCAGTCATCATCATCCC 280 
AGCTTTTCCCGGCTCGAATTCAGCCTCCAACTCAAGCTCG 320 
CGGGAAAGACTACCTGAGAGGAGAAAAGCTTCTGTCCCTG 360 
GACCTTCrrCTGAGGGTGGAGTCGGAGGCTCCCTGCTTTC 400 

410 420 430 440 

.... t .. .. i t .... t ... i I i i i i I i t i ■ 1 

CAGCCGCCCAGTGACCCAAGCTTAATCTTCAGCACCACTT 440 
GGGGCGACCTTTTCGGTGCAAACCTACGA7TCTGTTTCTC 480 
AGGATTCCTCCCCATCCCGCTTCGCCCCGGAAAAGCTGAC 520 
AAGAACTTCAGGTGTAAGCCCTGAGTAGTGAGGATCTGCG 560 
GTCTCCGTGGAGAGCTGTGCCTGGAAGAGAAGGACGCTGG 600 

610 620 630 640 

, ■ r * 1 ... i .... I ... i I ... » 1 i i i t !■ i i i t 

TGGGGGCTGAGATCAGAGCTGTCTTCTGGCCCAGTTGCCC 640 
CC ATGC TTC TG TC A TGGCT A AC AG TTCTAGGGGCTGGA AT 680 
GGTCGTCC TGCACTTC TTGCAGAAACTCCTGTTCCC TTAC 720 
TTTTGGGATGAC TTCTGGTTCGTGTTGAAGGTGGTGC7CA 760 
TTATAATTCGGCTGAAGAAGTATGAAAAGAGAGGGGAGCT 800 

310 820 830 8*C 
I , r i .... i .... i .... i .... l 

GGTGACTGTGCTGGATAAATTCTTGAGTCATGCCAAAAGA 840 
CAACCTCGGAAACCTTTCATCATCTATGAGGGAGACATCT 880 
ACACCTATCAGGATGTAGACAAAAGGAGCAGCAGAGTGGC 920 
CCATGTCTTCCTGAACCATTCCTCTCTGAAAAAGGGGGAC 960 
ACGGTGGCTCTGCTGATGAGCAATGAGCCGGACTTCGTTC 1000 

1010 1020 1030 1040 

»... , ! i .... i .... i .... I .... f .. t . 1 

ACGTGTGGTTCGGCCTCGCCAAGCTGGGCTGCGTGGTGGC 1040 
CTTTCTCAACACCAACATTCGCTCC AAC TCCCTCCTGAAT 1080 
TGCATCCGCGCCTGrGGGCCCAGAGCCCTAGTGGTGGGCG 1 120 
CAGATTTGCTTGGAACGGT AGAAGAAATCCTTCC AAGCCT 1 160 
CTC AG A A AA T A TC AG TG T T TGG GGG A TG A AAG A TTC TG TT 1200 



Fv,. StA 



57/117 

hsFATPS full lenqhtDNA 

1210 1220 1230 1240 

. . . . i . , , f t . . . ■ I .... t .... t ... t l t i t i i i t t t I 

CCACAAGGTGTAATTTCACTCAAAGAAAAACTGAGCACCT 1240 
CACCTGATGAGCCCGTGCCACGCAGCCACCATGTTGTCTC 1280 
ACTCCTCAAGTCTACTTGTCTTTACATTTTTACCTCTGGA 1320 
ACAACAGGTCTACCAAAAGC AGCTGTGATTAGTCAGCTGC 1360 
AGGTTTTAAGGGGTTCTGCTGTCCTGTGGGCTTTTGGTTG 1400 

1410 1420 1430 1440 

.... i i t , i .... » ■ ■ ■ » i « . ■ uJ 

TACTGCTCATGACATTGTTTATATAACCCTTCCTCTGTAT 1440 
CATAGTTCAGCAGCTATCCTGGGAATTTCTGGATGTGTTG 1480 
AGTTGGGTGCCACTTGTGTGTTAAAGAAGAAATTTTCAGC 1520 
AAGCCAGTTTTGGAGTGACTGCAAGAAGTATGATGTGACT 1560 
GTGTTTCAGTATATTGGAGAACTTTGTCGCTACCTTTGCA 1600 

1610 1620 1630 1640 

r - ■ • ' » 1 « t i i t i . ■ i I ■ ■ » i I » ■ i ■ 1 

AACAATCTAAGAGAGAAGGAGAAAAGGATCATAAGGTGCG 1640 
TTTGGCAATTGGAAATGGCATACGGAGTGATGTATGGAGA 1 680 
GAATTTTTAGACAGATTTGGAAATATAAAGGTGTGfGAAC 1 720 
TTTATGCAGCTACCGAATCAAGCATATCTTTCATGAACTA 1760 
CACTGGGAGAATTGGAGCAATTGGGAGAACAAATTTGTTT 1800 

1810 1820 1830 1840 

i .... ' I i ■ i i ' i ■ i i I ' i i i F ■ t ■ ■ I ■ ■ i i 1 

TACAAACTTCTTTCCACTTTTGACTTAATAAAGTATGACT 1840 
TTCAGAAAGATGAACCCATGAGAAATGAGCAGGGTTGGTG 1880 
TATTCATGTGAAAAAAGGAGAACCTGGACTTCTCATTTCT 1920 
CGAGTGAATGCAAAAAATCCCTTCTTTGGCTATGCTGGGC 1960 
CTTATAAGCACACAAAAGACAAATTGCTTTGTGATGTTTT 2000 

2010 2020 2030 2040 

i * I , , « ■ t ■ i i t I i i i i 1 t i i i I 

TAAGAAGGGAGATGTTTACCTTAATACTGGAGACTTAATA 2040 
GTCCAGGATCAGGACAATTTCCTTTATTTTTGGGACCGTA 2080 
CTGGAGACACTTTC AGATGGAAAGGAGAAAATGTCGCAAC 2120 
CACTGAGGTTGCTGATGTTATTGGAATGTTGGATTTCATA 2160 
CAGGAAGCAAACGTCTATGGTGTGGCTATATCAGGTTATG 2200 

2210 2220 2230 2240 

. . . . i . . f . i ... i | ...it . . . i > . i i . I i t i i f t i i t 1 

AAGGAAGAGCAGGAATGGCTTCTATTATTTTAAAACCAAA 2240 
TACATCTTTAGATTTGGAAAAAGTTTATGAACAAGTTGTA 2280 
ACATTTCTACCAGCTTATGCTTGTCCACGATTTTTAAGAA 2320 
TTCAGGAAAAAATGGAAGCAACAGGAACATTCAAACTATT 2360 
GAAGCATCAGTTGGTGGAAGATGGATTTAATCCACTGAAA 2400 

2410 2420 2430 2440 

, , ■ ,1 ■ , , I » ■ . r I . ■ » ■ I ■ » ■ ■ 1 ■ 

ATTTCTGAACCACTTTACTTC ATGGATAACTTGAAAAAGT 2440 
CTTATGTTCTACTGACCAGGGAACTTTATGATCAAATAAT 2480 
GTTAGGGGAAATAAAACTTTAAGATTTTTATATCTAGAAC 2520 
TTTCATATGCTTTCTTAGGAAGAGTGAGAGGGGGGTATAT 2560 
GATTCT TTATGAAATGGGGAAAGGGAGCTAACATTAATTA 2600 
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hsFATP6 full lenght.DNA 

2610 2620 2630 2640 

. . . . 1 . ■ T • ' ' 1 ■ f • 1 ' ' ' ' ' ' ' ' ' ' 1 • ' ' ' ' ' 1 ' 1 ' ' 1 

TGCATGTACTATATT7CCTTAATATGAGAGATAATTTTTT 2640 
AATTGCATAAGAATTTTAATTTCTTTTAATTGATATAAAC 2680 
ATTAGTTGATTATTCTTTTTATCTATTTGGAGATTCAGTG 2720 
CATAACTAAGTATTTTCCTTAATACTAAAGATTTTAAATA 2760 
ATAAATAGTGGCTAGCGGTTTGGACAATCACTAAAAATGT 2800 

2810 2820 2830 2840 

I I i . . ■ i I i ■ ' ■ 1 i i t i I » » i ■ l > ■ . ■ 1 

ACTTTCTAATAAGTAAAATTTCTAATT7TGAATAAAAGAT 2840 
TAAATTTTACTGAAAAAAAAAAAAAAAAAAAAAATTGGCG 2880 
GCCGC 2885 



Fid- 
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hsFATP6 full lenght.protein 



10 20 30 40 

l , , r . i . . . , i t t • ■ ' , I . i ■ t l ■ i i » I 



40 
Y 80 



MLLSWLTVLGAGMVVLHFLQKLLFPYFWDDFWFVLKVVLI 
tIRLKKYEKRGELVTVLDKFLSHAKRQPRKPFIlYEGDIY 
TYQ^VDKRSSRVAHVFLNHSS 20 
VWFGLAKLGC WAFLNTN I ^NSLLNC 1 RACGPRAL V VGA 160 
qllqjvEE I LPSLSEN ISVWGMKDSVPQGV I SLKEKLSTS 200 
210 220 230 240 



pnrPVPRSHHVVSLLKSTCLY I FTSGTTGLPKAAV I SQLQ 240 
VLRGSAVLWAFGCT AHO IVY I TLPL YHSSAA I LG I SGC VE 280 
LGATCVLKKKFSASQFWSOCKKYOVTVFQY I GELCRYLCK 320 
QSKREGEKDHKVRLA I-GNG I RSOVWREFLORFGN I K VCEL 360 
YAATESS I SFMNYTGR I GA I GRTNLFYKLLSTFOL 400 



£120 430 440 



410 

• . . i .... i . . . ■ I 



n^nFPMRNEQGWC HVKKGEPGLL I SR VNAKNPFFGYAGP 440 

YKHTKOKLLCDVFKKGu^ J nc anv YGVA^SGYE 520 

rrvrFRWKGENVATTE V AO V I GMLDF I QEANVYGVA I SGYE 520 
GRAGMA^f LKPNTSLDLEKVYEQVVTFLPAYACPRFLRI 560 
QEKMEATGTFKLLKHQLVEOGFNPLK I SEPLYFMONLKKS 600 
610 620 630 640 



YVULTRELY0QIMLGE1KL. 620 

fiy SS 
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mFATPI full length. DNA 



10 20 30 40 

T I ..»■■.■ I I .... I >»»■ I I * I I I ■ I ■ t I tilt I 

AAGTTCCCACTCCAGACTTCTGCGAGAACCCGTGAGGAAG 40 
CAGCGAGAACCGGGGGTTTGCAAGCCAGAGAAGGATGCGG 80 
ACTCCGGGAGCAGGAACAGCCTCTGTGGCCTCATTGGGGC 120 
TGCTTTGGCTTCTGGGACTTCCGTGGACCTGGAGCGCGGC 160 
GGCGGCGTTCGGTGTGTACGTGGGTAGCGGTGGCTGGCGA 200 

210 220 230 240 

.... » . . .. i .... i .... i ..»» i ... . 1 .... t ... i I 

TTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAGACCTCT 240 
TTGGCCTCTCTGTTCTGATCCGCGTGCGGCTAGAGCTACG 280 
ACGACACCGGCGAGCAGGAGACACGATCCCACGCATCTTC 320 
CAGGCCGTGGCCCAGCGACAGCCGGAGCGCCTGGCGCTGG 360 
TAGATGCGAGTAGCGGTATCTGCTGGACCTTCGCACAGCT 400 

410 420 430 440 

t ' I ■ ■ . . I . , , . I , , , r I , , , . I 

AGACACCTACTCCAATGCTGTGGCCAATCTGTTCCTCCAG 440 
CTGGGCTTTGCGCCAGGCGATGTGGTGGCTGTGTTCCTGG 480 
AAGGCCGGCCCGAGTTCGTGGGACTGTGGCTGGGCCTGGC 520 
CAAGGCCGGTGTAGTGGCTGCGCTTCTCAATGTCAACCTG 560 
AGGCGGGAGCCCCTTGCCTTCTGCTTGGGCACATCAGCTG 600 

610 620 630 640 

.... i .... i t ... i ... . t i . i i I i i i i I i i i i I i i t i I 

CCAAGGCCCTCATTTATGGCGGGGAGATGGCAGCGGCGGT 640 
GGCGGAGGTGAGTGAGCAGCTGGGGAAGAGCCTGCTCAAG 680 
TTCTGCTCTGGAGATCTGGGGCCTGAGAGCGTCCfGCCTG 720 
ACACGCAGCTTCTGGACCCCATGCTTGCTGAGGCGCCCAC 760 
CACACCCCTGGCACAGGCCCC AGGCAAGGGCATGGATGAT 800 

S10 820 830 840 

. . . . i . . .. i .... i .... t ....ttt.it i i t t t i i i t I 

CGGCTATTTTACATCTATACTTCTGGGACCACCGGACTTC 840 
CTAAGGCGGCCATTGTGGTGC ACAGCAGGTACTACCGCAT 880 
CGCAGCCTTCGGCCACCATTCCTACAGCATGCGGGCCAAC 920 
GATGTGCTCTATGACTGCCTACCTCTCTACCACTCAGCAG 960 
GGAACATCATGGGCGTGGGACAGTGTATCATCT ACGGGTT 1000 

1010 1020 1030 1040 

> i ■ . ■ , , | ■ ■ , , » ' I i 

AACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGCCGCTTC 1040 
TGGGACGACTGTGTCAAATATAATTGCACGGTAGTGCAGT 1080 
ACATCGGTGAAATA7GCCGCT ACCTGCTAAGGCAGCCGGT 1 120 
TCGCGATGTAGAGCGGCGGCACCGCGTGCGCCTGGCCGTG 1 160 
GGTAACGGACTGCGGCCAGCCATCTGGGAGGAGTTCACGC 1200 



SCA 
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mFATPI full lenqth.DNA 

1210 1220 1230 1240 

i , t t i . ■ i i 1 ■ i « ■ t • i ■ • 1 i i • » 1 i i » ■ 1 ■ « » 1 1 1 1 1 1 1 

AGGGTTTCGGTGTGCGACAGATTGGCGAGTTC TACGGCGC 1240 
CACCGAATGCAACTGC AGCATTGCCAACATGGACGGCAAG 1280 
GTCGGCTCCTGCGGC TTC AACAGCCGTATCCTCACGCATG 1320 
TGTACCCCATCCGTCTGGTCAAGGTCAACGAGGACACGAT 1360 
GGAGCCACTGAGGGACTCCCAAGGCCTCTGCATCCCGTGC 1400 

1410 1420 1430 1440 

, , , ■ t . ■ . ■ « I * * i t 1 » ■ « ■ i .till i i i i i t i i t I 

CAGCCCGGGGAACC TGGGCTTC TCGTGGGCCAGATCAACC 1440 
AGCAAGACCCTCTGCGGCGCTTCGATGGCTATGTTAGTGA 1480 
CAGCGCCACCAACAAGAAGATTGCCCACAGCGTGTTCCGA 1520 
AAGGGGGACAGCGCCTACCTTTCAGGTGACGTGCTAGTGA 1560 
TGGACGAGCTGGGGTACATGTAC TTCCGTGACCGCAGCGG 1600 

1610 1620 1630 1640 

, ■ , ■ ' i . ■ i { i ■ i » ' i i i i 1 » ' ' ' 1 ' ■ ' ' 1 ' ' 1 ' 1 1 1 1 1 1 

GGATACCTTCCGATGGCGCGGCGAGAACGTATCCACCACG 1640 
GAGGTGGAAGCCGTGCTGAGCCGCCTGTTGGGCCAGACGG 1680 
ACGTGGCTGTGTATGGAGTGGCTGTGCC AGGAGTGGAGGG 1720 
GAAAAGCGGCATGGCGGCCATTGCAGACCCCCACAACCAG 1760 
CTGGACCCTAACTC AATGTACCAGGAATTGCAGAAGGTTC 1800 

1810 1820 1830 1840 

i .... I , I I I I 1 I I I I . I I ■ I I I I I I 

TTGCATCCTATGCCCAGCCC ATCTTCCTGCGTCTTCTGCC 1840 

CCAAGTGGATACAACAGGCACCTTCAAGATCCAGAAGACC 1880 

CGACTACAGCGTGAAGGCTTTGACCCCCGCCAGACCTCAG 1920 

ACCGGCTCTTCTTTCTAGACCTGAAACAGGGACGCTACCT 1960 

ACCCCTGGATGAGAGAGTCC ATGCCCGCATCTGCGCAGGC 2000 

2010 2020 2030 2040 

GACTTCTCACTCTGAGCCTGGTGAGTGGGATGGCCCTGGA 2040 
CTTGTGAGACC AGGGAGCCGGACACCCCTGTTCAGGTGTT 2080 
TCTCCTGCCTGGCCACGTGGCCAGCAGC ACCTGTGGGTGC 2120 
AGGAAAC TGGAACCTGAGTGGCCGGGTGTCCCTTTCCTAC 2160 
AACCCACCATGCACACATC TAGCCTCTGCC TTGGTCTTTT 2200 

2210 2220 2230 2240 

.... i . , . . i t ....»..■■ I ...» i t t t t 1 

TCTCCATCTCTTTCCTCCGTGCCCAGCAGGAGCCCCACAG 2240 
ACACATTGGCTGCTGTGTCCTGCAGTGGGACCGGTGTCTA 2280 
GGGGTCCATGCTGCAGGCTGTGACCCGC ACTGGTGCCC AC 2320 
CTCCCTTCCCCATTGTGCC TTAGGTTCCTCCAC TGTGCGC 2360 
CGGTGAAGCAAGTGGGGACCCAC ATAGCTGTTGTCCCTGC 2400 

2410 2420 2430 2440 

. . . . i . . . . < . . . ■ | . ■ . i f • i i i 1 » ■ ' ■ 1 ■ ■ * ' 1 i ■ ' » 1 

TGAGGGTTG6TAGCAAATGCACCCTCATGTCAGCTGGGAG 2440 
ACACATGCAGTCTCCCACTGACCCCCAATC AACTGAAGAT 2480 
ACTGTTTTGTATTATTGTTTTGAGATAGGGTCTCACTGTG 2520 
GAGGCCAAGCTGGCCTCAGGCTCACCACTC TACTGCCTCC 2560 
GGGCACCAGCCTGCAGTTTGATGACATGTATGCACTATTG 2600 
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mFATPI full length.DNA 

2610 2620 2630 2640 

TTCTAAGGGTCTTCTGAGTCCCTGCTTTCCCCTCATGTCC 2640 
TAAAACCTTCCAGAACTGACTCTGATCACTTGGATGTAGC 2680 
TAGTGTTGGCCCTGCCCACGTGTGTCAATTCAGGGGTCCC 2720 
CAGGCATCATCTCTGGAGGCCCTAACCTTGGCAAAGCTTG 2760 
GATGTCCTCACATCACAGCAGGAGACCCAGGAAGGTTGCT 2800 

2810 2820 2830 2840 

I i .... i .... i .... i .... i [ 

GTGGTGTCTCTTGGGCACCCCTGGCGGCAGCCGTGGACAT 2840 
GCTTCCCTGCTGTGATAGCCCAAACTGTTGCCTATGACAT 2880 
TTGAGGTCTACCCTTCTGGCTGCCATGGTCCCCA7TGAGA 2920 
TCTTTGGTGACTCACCTCAGCCACCAAGCC AGGCCTCTGC 2960 
CTTCCT7CAGCTCTAAGGGC ATGAAGGGTGTGGACAGAGC 3000 
3010 3020 3030 3040 

" ■ f 1 * ' ' ' ' 

AGCCACAGGCTGCCCACAGTCACCCACATGCAAGTGTTAT 3040 
TTCCTTGTTTGTTTTAAAAAAATAAACATGCTGAGCCTTG 3080 
AAAAAAAAAAAAAAAAAA 3098 
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mFATPI full ienghtprotein 



10 20 30 40 

.... i .... i .... i .... i ».»» i .« i i l i ■ i i i i i i i I 

MRTPGAGTASVASLGLLWLLGLPWTWSAAAAFGVYVGSGG 40 
WRFLR I VCKTARRDLFGLSVL I RVRLELRRHRRAGOT I PR 80 
I FOAVAGRQPERLAL VDASSG I CWTFAQLDTYSNAVANLF 120 
LQLGFAPGD VVAVFLEGRPEFVGLWLGLAKAGVVAALLNV 160 
NLRREPLAFCLGTSAAKAL I YGGEMAAAVAEVSEQLGKSL 200 

210 220 230 240 

.... i .... i ......... i « * ■ ■ >»■».»»■»■ » i i ■ i 1 

LKFCSGDLGPESVLPDTQLLDPMLAEAPTTPLAOAPGKGM 240 
DORLFY I YTSGTTGLPKAA I VVHSRYYR 1 AAFGHHSYSMR 280 
ANDVLYOCLPLYHSAGN I MGVGOC I I YGLTVVLRKKFSAS 320 
RFWDDCVKYNCTV VQY I GE I CRYLLRQP VRDVERRHRVRL 360 
AVGNGLRPA I WEEFTQGFG VRQ I GEF YGATECNCS I ANMD 400 

410 420 430 440 

.... i .... I .... i .... I .... t ... ■ 1 i iii I it i t ! 

GKVGSCGFNSR I LTHVYP I RLVKVNEDTMEPLRDSQGLC I 440 
PCQPGEPGLLVGQ I NQODPLRRFDGYVSOSATNKK I AHSV 480 
FRKGDSAYLSGDVLVMDELGYMYFRDRSGOTFRWRGENVS 520 
TTEVEAVLSRLLGQTD VAV YGVAVPGVEGKSGMAA I AOPH 560 
NQLDPNSMYQELQKVLASYAQP I FLRLLPQVDTTGTFK I Q 600 

610 620 630 640 

.... i .... i .... i ... ■ i .... t . . . . 1 . . . . I . i i ■ i 

KTRLQREGFDPRQTSDRLFFLDLKQGRYLPLDERVHAR I C 640 
AGDFSL. 647 

F. 3 . 57 
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mVLACS(FATP2)full length.DNA 



10 20 30 40 

I • « - - i ■ ■ ■ ■ 1 ■ * 1 

GACACAGTACTGCCGATGTTGGACAGAGGATCGCTTAACA 40 
GAACGAAATCTC AAAACAAATTAACAGGACCCGGTTGCTT 80 
GATTTCCCAAATCAGAAAAGGCTCGAAATGTCTAGAGGGG 120 
CTGACTGATGCAGCGGTGACCCGGACTGGAGACAGTTGGA 160 
CGCGATCATCTCTGGTGCTTTTGTTCAACC TTGAAACCTT 200 

210 220 230 240 

. ... i .... I <■ •»>»*■• * ■ » ■ t i i i i t I i i i i l i i » i t 

CGCCACAGGAGACTTGCCTGAGCAGAGAAGCAAACGTGGA 240 
GAAACAAAGAGAGATCTAGCGAAAAGCCTCTGGGACCAAG 280 
GAGGGGAGGTGGGACTCTGGGTTGGCGGTGGCACCTGCTG 320 
CCGGCTATTAATAATAGGGTCGCGATGCGTTTATAAGGTG 360 
TTTG ATT A AAC A A AG AC TC TATGAGAGAAGAATAACTAGC 400 

a 10 420 430 440 

i i ■ i i 1 i . i ■ I i ■ ■ « I ■ i ' t I ■ ■ ' ■ » ' i ' » 1 ■ ■ ■ ■ I 

AACAGCCCCACGTCTGAGTCGTCGCCTCCGACCTTTTTCA 440 
ACGTGGGTTCTTTGGGCCGAGCGTCGTTTGCCGAGAACTA 480 
GATCTCACCTGACCCCAGACGCTGAAAACAAGCGCTGTGG 520 
CATCCTGGGCCACCCAAGC TGACAAGGGCGCGCCCCCTGA 560 
GCACACGAGGTGCCCCACGAGGGGGAGGGACCCACAGCCG 600 

610 620 630 640 

.... i .... t .... i I ■ i i i l i i i i I i i i i 1 i i i i f 

TCCCGCCCGCACCGCGGTGTCCGCTGCGGGCACCTGCAGC 640 
CGAGCCGCCACCCGCAGTCGCAGCGCGTCCGGCGGCCGAA 680 
CCCGGTCGTCAGCTCGTCAGCACCTGCTCTGCTTCTC7CC 720 
CGCCCGCCGCCGCGCTGCACGCCTCGAGCGCTCCCTCGGC 760 
CCCGGCGGGGACCGGGGACCCCGCAGCCACCGCC ATGCTG 800 

810 820 830 840 

• ■ - ■ i ■ » « » l .... i « i . t I t i i i 1 i t i ■ I ttii I ■ i ■ i I 

CCTGTGCTCTACACCGGCCTGGCGGGGCTGCTGCTGCTGC 840 
CTCTGCTGCTCACCTGCTGCTGCCCCTACC TCCTCCAGGA 880 
CGTGCGGTTCT TCCTGCAACTGGCCAACATGGCCCGGCAG 920 
GTGCGCAGCTACCGGCAGCGGCGACCCGTGCGCACCATCC 960 
TGCATGTCTTCTTGGAGCAAGCGCGCAAGACCCCGCACAA 1000 

1010 1020 1030 1040 

.... i .... I .... i .... I i i i i I i i i t 1 i i i i I i t i i 1 

GCCCTTCCTGC TGTTTCGCGACGAGACGCTTACCTACGCC 1040 
CAGGTAGACCGGCGCAGCAACCAAGTAGCGCGAGCGCTGC 1080 
ATGATCACCTGGGCCTGCGGCAGGGGGATTGCGTGGCCCT 1 120 
CTTCATGGGCAATGAGCCGGCCTACGTGTGGCTCTGGCTG 1 160 
GGACTGCTCAAACTGGGCTGTCCCATGGCG TGCCTCAACT 1200 



fry S2h 
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mVLACS(FATP2)full length.DNA 

1210 1220 1230 1240 

' i i i i ' i i i i I i ■ ■ ■ I ■ ■ ■ ■ I » » ' 1 1 

ACAACATCCGTGCCAAGTCTCTGCTACACTGCTTTCAGTG 1240 
CTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAGCTACAC 1280 
GAAGCTGTCGAGGAGGTTCTTCCAACCCTGAAAAAGGAGG 1320 
GCGTGTCCGTCTTCTACGTAAGCAGAAC TTCTAACACTAA 1360 
TGGCGTGGACACAGTACTGGACAAAGTAGACGGGGTGTCG 1400 

1410 1420 1430 1440 

, , t • I I 1 > i ■ ■ 1 i ' ■ ■ » ■ ■ ' ' 1 » 1 1 ' 1 1 ' 1 1 1 

GCGGACCCCATCCCGGAGTCGTGGAGGTCTGAAGTCACGT 1440 

TCACCACACCCGCAGTCTACATATATACTTCGGGCACCAC 1480 

AGGTCTTCCAAAGGCTGCAACCATTAATCACCATCGCCTC 1520 

TGGTATGGGACCAGCCTTGCCCTGAGGTCCGGAATTAAGG 1560 

CTCATGACGTCATCTACACCACCATGCCCCrGTACCACAG 1600 

1610 1620 1630 1640 

ii'** i i ■ ■ i I > > ■ ■ I ' ' ■ ■ ' ■ ■ ' ' 1 ■ ' ■ ' 1 ' ' 1 J 

CGCGGCGCTCATGATTGGCCTCCACGGATGCATTGTGGTT 1640 
GGGGCTACATTTGCTTTGCGGAGCAAATTTTCAGCCAGCC 1680 
AGTTTTGGGACGACTGCAGGAAATACAACGCCACTGTCAT 1720 
TCAGTACATCGGTGAACTGCTTCGGTACCTCTGCAACACG 1760 
CCCCAGAAACCAAATGACCGGGACCACAAAGTGAAAATAG 1800 

1810 1820 1830 1840 

_- t .... t .... i .... I .... t i . » i I i t t r I i i i i I 

CACTAGGAAATGGCTTACGAGGAGATGTGTGGAGAGAGTT 1840 
CATCAAGAGATTTGGGGACATTCACATTTATGAGTTCTAC 1880 
GCTTCCACTGAAGGCAACATTGGATTTATGAACTATCCAA 1920 
GAAAAATCGGAGCTGTTGGAAGAGAAAATTACCTACAAAA 1960 
AAAAGTTGTAAGGCACGAGCTGATC AAGTATGACGTGGAG 2000 

2010 2020 2030 2040 

, ■ T . I 1 I . . . . I . . . . I ■ ■ I I I ■ . I . 1 ■ ■ ■ I 1 

AAGGATGAGCCTGTCCGTGATGCAAATGGATATTGC ATCA 2040 
AAGTCCCCAAAGGAGAGGTTGGACTCTTGATTTGCAAAAT 2080 
CACAGAGCTCACACCATTTTTTGGCTATGCTGGAGGAAAG 2120 
ACCCAGACAGAGAAGAAAAAGCTCAGAGATGTTTTTAAGA 2160 
AAGGAGACGTCTACTTCAACAGTGGCGATCTCCTGATGAT 2200 

2210 2220 2230 2240 

• • ■ - t . . . • < - ■ • • i ■ » » ■ t » • « » i ■ ■ t ■ I i i t i 1 • i » i 1 

CGACCGTGAAAATTTCATCTATTTTCACGACAGAGTTGGA 2240 
GACACCTTCCGGTGGAAAGGAGAGAATGTAGCTACCACGG 2280 
AAGTCGCTGACATTGTGGGACTGGTAGATTTTGTTGAAGA 2320 
AGTGAATGTTTACGGTGTGCCCGTGCC AGGTCATGAAGGT 2360 
CGCATCGGGATGGCCTCGATCAAGATGAAAGAAAACTACG 2400 

2410 2420 2430 2440 

■ 1 ......... t I ■ . . . I . . . . 1 — 

AGTTCAATGGAAAGAAACTCTTTC AGCACATCTCGGAGTA 2440 
CCTGCCCAGTTACTCGAGGCCTCGGTTCCTGAGAATACAA 2480 
GATACCATTGAGATCACCGGGAC TTTTAAACACCGCAAAG 2520 
TGACCCTGATGGAAGAGGGCTTTAACCCCTCAGTCATCAA 2560 
AGATACCTTGTATTTCATGGATGAC ACAGAAAAAACATAC 2600 
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mVLACS(FATP2)full length.DNA 

2610 2620 2630 2640 

, t . . . . i . . . . i ■ t i t i i i t i I i i i i t t i i i \ 

GTGCCCATGACTGAGGACATTTATAATGCCATAATTGATA 2640 
AGACTCTGAAGCTCTGAATGTTGCCTGGCTCCTAACACTT 2680 
CCAGAAAGAAACACAATAGGCCTAGCATAGCCCCTTCACA 2720 
TGTGTAATCCAACTTTAACTTGATTAAAGGTTATAGGTGT 2760 
GATTTTTCCTAGGAAATTATTCATTTAAAGGACAATTGTT 2800 
2810 2820 2830 2840 

, . . . > . | | | 1 l | • ' 1 I I l ' 1 ' ■ I 1 1 ' ' ' I 1 ' ' I * 1 ' ' 1 1 1 

TGTTTGTTTGTTTGTTTTTTATTAATTACACCAGAACGTT 2840 
TGCAAGTAAAAAGATTTAAAGTCACTTATTTTTCAATGTG 2880 
CACCTGCCATTTGTCCTTGCAAACTTAGCTTCTTGGAGAG 2920 
AGGGCCTTATTTTTTTAAAGACATAATAAACTATGTAAAC 2960 
ACT 2963 
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mVLACS(FATP2)full length.prot 



10 20 30 40 

i t t i I . i i t I i i i i I t t i i I t i i i l i i i i I t i i i l > i t i I 

MLPVLYTGLAGLLLLPLLLTCCCPYLLQDVRFFLQLANMA 40 
RQVRSYRQRRPVRT I LHVFLEQARKTPHKPFLLFRDETLT 80 
YAGVDRRSNQVARALHDHLGLRQGDC VALFHGNEPAY VWL 120 
WLGLLKLGCPMACLNYN I RAKSLLHCFQCCGAKVLLASPE 160 
LHEAVEEVLPTLKKEGVSVFYVSRTSNTNGVOTVLDK VDG 200 

210 220 230 240 
. ... i , ... I .... i .... I .... i .,..<■,■■ r ,,,, I 

VSADP IPESWRSEVTFTTPAVY I YTSGTTGLPKAAT I NHH 240 
RLWYGTSL ALRSG I KAHOV I YTTIiPL YHSAALM I GLHGC I 280 
VVGATFALRSKFSASGFWDOCRKYNATV I QY IGELLRYLC 320 
NTPQKPNDRDHKVK I ALGNGLRGD VWREF I KRFGO I H I YE 360 
FYASTEGN I GFMNYPRK I GAVGRENYLQKKV VRHEL I KYD 400 

410 420 430 440 

I I T I I I I I I I 1 I I I I I I 1 I 1 1 I 1 1 I I I I I 1 I I 1 I I I I I 1 I 

VEKDEPVROANGYC IKVPKGEVGLL I CK I TELTPFFGYAG 440 
GKTOTEKKKLRDVFKKGDV YFNSGDLLM ! ORENF I YFHDR 480 
VGDTFRWKGENVATTEVAO I VGLVOFVEEVNVYG VPVPGH 520 
EGR I GMAS I KhKENYEFNGKKLFQH I SEYLPSYSRPRFLR 560 
I QOT I E I TGTFKHRKVTLMEEGFNPSV I KDTL YFMDDTEK 600 

610 620 630 640 

i i t i 1 i i i i I i i t i I i i i i 1 i i i i I i t i t 1 i i i i ( i i i i I 

TYVPMTED I YNAI J DKTLKL. 621 
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mFATP4 partialPNA 



10 20 30 40 

, , , , | * | . , t , | . , , , 1 ■ t i i 1 ■ ■ i i I t ■ i i I 

GATCAGCTCTTCTATATCTACACGTCGGGCACCACGGGGC 40 
TACCCAAAGCTGCCATTGTGGTGCACAGCAGGTATTACCG'80 
AATGGC TGCCCTGGTGTACTATGGATTCCGCATGCGGCCT 120 
GATGACATTGTCTATGACTGCCTCCCCCTCTACCACTCAG 160 
CAGGAAACATTGTGGGGATTGGCCAGTGCGTACTCCACGG 200 

210 220 230 240 

.... i t . i .... t .... i ... « ' t t ■ t i I 

CATGAC TGTGGTGATCCGGAAGAAGTTTTCAGCCTCCCGG 240 
TTCTGGGATGACTGTATCAAGTACAACTGCACAATTGTAC 280 
AGTACATTGGTGAGCTTTGCCGCTACCTCCTGAACCAGCC 320 
ACCCCGTGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCA 360 
CTGGGCAACGGTC TCCGGCAGTCCATCTGGACCGACTTCT 400 

410 420 430 440 

- ■ . . i . ■ i i ^ • « ■ i i . . . ■ I ■ t t t t t i i i I t i t t I i t i t 1 

CCAGCCGTTTCCACATTCCCAAGGTGGCCGAGTTCTACGG 440 
GGCCACCGAGTGCAACTGTAGCTTGGGC AACTTTGACAGC 480 
CAGGTGGGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCT 520 
TTGTGTACCCCATCCGCTTGGTACGAGTCAATGAGGATAC 560 
CATGGAACTGATCCGGGGACCCGATGGCGTCTGCATTCCC 600 

610 620 630 640 

.... i , .. . i .... i 1 * i ■ i I i i i i 1 ' i ■ i I ■ ■ ' ' I 

TGTCAACCAGGCCAGCC AGGCC AGCTGGTGGGTCGCATCA 640 
TCCAGCAGGACCCCCTACGCCGTTTTGATGGCTACCTCAA 680 
CCAGGGTGCCAACAACAAGAAGATTGCTAGTGATGTCTTC 720 
AAGAAAGGGGACCAAGCCTACCTC ACTGGTGACGTGCTGG 760 
TGATGGATGAGCTGGGC TACCTGTAC TTCCGAGACCGCAC 800 

810 820 830 840 

t .... 1 .... t ... r . t I I I 1 I I 1 t 1 I I 

AGGGGACACGTTCCGCTGGAAAGGGGAGAATGTGTCTACC 840 
ACTGAAGTGGAGGGCAC ACTC AGCCGCCTGCTTCAGATGG 880 
CAGATGTGGCTGTTTATGGTGTTGAGGTGCCAGGAGCTGA 920 
GGGCCGAGCAGGAATGGCTGCTGTGGCAAGCCCCAC TAGC 960 
AACTGTGACCTGGAGAGCTTTGC ACAGACCTTGAAAAAGG 1000 

1010 1020 1030 1040 

.... t .. . . » ...■>■•■■ f .... i ... . 1 ■ ■ t i l i t t i 1 

AGCTGCCCCTGTACGCCCGCCCCATCTTCCTCCGCTTCT7 1040 
GCCTGAGCTGCACAAAAC AGGAACC TTC AAGTTCCAGAAG 1080 
ACAGAGTTGCGGAAGGAGGGC TTTGACCCGTCTGTTGTGA 1 120 
AAGACCCACTC TTCTATTTGGATGCCCGGACAGGCTGCTA 1 160 
TGTTGCACTGGACCAAGAGGCC t ATACCCGCATCCAGGCA 1200 



fey 60 A 
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mFATP4 partial.DNA 

1210 1220 1230 1240 

t i 1 i i i t f i i i t <t ■ t ♦ I t i tt 1 

GGCGAGGAGAAGCTGTGATTTCCCCCACATCCCTCTGAGG 1240 
GCCAGAGGATGCTGGATTCAGAGCCCCAGC7TCCACTCCA 1280 
GAAGGGGTCTGGGCAAGGCCAGACCAAAGCTAGCAGGGCC 1320 
CGCACCTTCACCCTAGGTGCTGATCCCCCT 1350 
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mFATP4partial.DNA 



10 20 so no 

»■■■■■ l , , , , l i , i . I . ■ . ■ I " ' ' ' 



DOLFY YTSGTTGLPKAAIVVHSRYYRMAALVYYGFRMRP 40 
DDI VYDCLPLYHSAGN I VG IGOCVLHGMTVV t RKKFSASR 80 
FWDDCIKYNCTIVOYIGEUCRYLLNQPPREAESRHKVRMA 120 
rrNGLRQS IWTOFSSRFH I PK VAEFYGATECNCSLGNFDS 160 
QVGACGFNSR ILSFVYP 1 RLVRVNEOTMEL 1RGP0GVC IP 200 
210 220 230 240 



CDPGQPGQLVGR QQOPLRRFOGYLNOGANNKK I ASOVF 240 
KKGOQAYLTGOVLVMDELGYLYFRDRTGOTFRWKGENVST 280 
TFVFPTLSRLLQMAOVAVYGVEVPGAEGRAGMAAVASPTS 320 
NCDLESFAQTLKKELPLYARP I FLRFLPELHKTGTFKF.QK 360 
TELRKEGFDPSVVKOPLFYLDARTGCYVALOQEAYTR I OA 400 

H20 430 140 



410 

i , , - 1 i , ■ ■ . I . i ■ ' ' 

GEEKL. 406 
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mmFATPI Ml length.PNA 



10 20 30 40 

.... I .... t < ♦ » » t t . t . i 

ATGCGGGCTCCTGGAGCAGGAAC AGCCTCTGTGGCCTCAC 40 
TGGCGCTGCTTTGGTTTCTGGGACTTCCGTGGACCTGGAG 80 
CGCGGCGGCGGCGTTCTGTGTGTACGTGGGTGGCGGCGGC 120 
TGGCGCTTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAG 160 
ACCTCTTTGGCCTCTCTGTTCTGATTCGTGTTCGGCTAGA 200 

210 220 230 240 

.... i . . .. t . ■ * ■ | • * . ■ i ... ■ i i i i i I i i i i i i ■ i i I 

GC TGCGACGAC ACCGGCGAGCAGGAGACACGATCCCGTGC 240 
ATCTTCCAGGC TGTGGCCCGGCGACAACCAGAGCGCCTGG 280 
CACTGGTGGACGCCAGTAGTGGTATATGCTGGACCTTCGC 320 
ACAGCTGGACACCTACTCCAATGC TGTAGCCAACCTGT7C 360 
CGCCAGCTGGGCTTTGCACCAGGCGATGTGGTGGCTGTGT 400 

410 420 430 440 

, ... i .... t . .. . i .... i .... i ... . I .... i . ■ ■ ■ t 

TCCTGGAGGGCCGGCCGGAGTTCGTGGGACTGTGGCTGGG 440 
CCTGGCCAAGGCCGGTGTGGTGGCTGCTCTTCTCAATGTC 480 
AACCTGAGGCGGGAGCCCCTGGCCTTCTGCCTGGGCACAT 520 
CAGCTGCCAAGGCCCTCATTTATGGCGGGGAGATGGCAGC 560 
GGCGGTGGCGGAGGTGAGCGAGCAGCTGGGGAAGAGCCTC 600 

610 620 630 640 

i ■ ■ . i . . ■ » 1 i ■ ■ ■ 1 i ■ • i 1 , I . « . . i . . t i I 

CTCAAGTTCTGCTCTGGAGATCTGGGGCCTGAGAGCATCC 640 
TGCCTGACACGC AGCTCCTGGACCCCATGCTTGCTGAGGC 680 
GCCCACCACACCCCTGGCACAAGCCCCAGGCAAGGGCATG 720 
GATGATCGGCTGTTTTACATCTATACTTCTGGGACCACCG 760 
GGCTTCC TAAGGCTGCCATTGTGGTGC ACAGCAGGTACTA 800 

810 820 830 840 

.... t .... i .... t * . I ■ i ■ i l i i i i I i i i i 1 i i i i I 

CCGCATTGCTGCCTTTGGCCACCATTCCTACAGCATGCGT 840 
GCCGCCGATGTGCTCTATGACTGCCTGCCACTCTACCACT 880 
CTGCAGGGAACATC ATGGGTGTGGGGC AGTGCGTCATCTA 920 
CGGGTTGACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGC 960 
CGCTTCTGGGATGACTGTGTCAAGTAC AATTGCACGG7AG 1000 

1010 1020 1030 1040 

. ... i .... i .... i .... I « ■ t ■ I i i i i I i i i i I i > t > I 

TGGATGACATAGGTGAAATCTGCCGCTACCTGCTGAGGCA 1040 
GCCGGTTCGCGACGTGGAGCAGCGACACCGCGTGCGCCTG 1080 
GCCGTGGGTAATGGGCTGCGGCCAGCCATCTGGGAGGAGT 1 120 
TCACGCAGCGC TTCGGTGTGCCACAGATCGGCGAGTTCTA 1 160 
CGGCGCTACCGAGTGCAACTGCAGCATTGCCAACATGGAC 1200 

f{y G2 A 
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mmFATPI full length.DNA 

1210 1220 1230 1240 

» ' ' i .... t i ■ , ■ ■ 1 

GGCAAGGTCGGCTCCTGCGGCTTCAACAGCCGTATCCTCA 1240 
CGCATGTGTACCCCATCCGTCTGGTCAAGGTCAATGAGGA 1280 
CACGATGGAGCCACTGCGGGACTCCGAGGGCCTCTGCATC 1320 
CCGTGCCAGCCCGGGGAACCCGGCCTTCTCGTGGGCCAGA 1360 
TCAACCAGCAGGACCC TCTGCGGCGTTTCGATGGTTATGT 1400 

1410 1420 1430 1440 

• ■ ■ ■ i ■ ■ ■ ■ I .... i .... 1 ... ■ I tiii 1 i i i i I i i t i 1 

TAGTGACAGTGCCACCAACAAGAAGATTGCCCACAGCGTT 1440 
TTCCGAAAGGGCGATAGCGCCTACCTCTCAGGTGACGTGC 1480 
TAGTGATGGACGAGCTGGGCTACATGTATTTCCGTGACCG 1520 
CAGCGGGGACACCTTCCGCTGGCGCGGGGAGAACGTGTCC 1560 
ACCACGGAGGTGGAAGCCGTGCTGAGCCGCCTACTGGGCC 1600 

1610 1620 1630 1640 

> .... i .... i .... I .... t ... i I .... 1 t i t ■ I 

AGACGGACGTGGCTGTGTATGGGGTGGCTGTGCCAGGAGT 1640 
GGAGGGGAAAGCTGGCATGGCAGCCATCGCAGATCCCCAC 1680 
AGCCAGTTGGACCCTAACTC AATGTACCAGGAATTACAGA 1720 
AGGTTCTTGCATCCTATGCTCGGCCCATCTTCCTGCGTCT 1760 
TCTGCCCCAGGTGGATACCACAGGCACCTTCAAGATCCAG 1800 

1810 1820 1830 1840 

. ■ . t t . . . i . » . . f « ■ ■ « I » t t . r ■ t i i ! i i t t I i t i i 1 

AAGACCCGGCTGCAGCGTGAAGGCTTTGACCCCCGTCAGA 1840 
CCTCAGACAGGCTCTTCTTTCTAGACCTGAAGTCCGGCAC 1880 
GAGGTATCTACCCCTGGATGAGAGAGTCCATGCCCGCATT 1920 
TGCGCAGGCGACTTCTCACTCTGAGCCTGGAGAGTGGGCT 1960 
GGGCCTGGACTCCTGAGACCTGGGAGCCTGACACCCCTCT 2000 

2010 2020 2030 2040 

.... i .... I .... i .... J i i ....... I . > . . i .... I 

TCGGGTGCTTCTCCTGCCTGGCCACATGGACAGCAGCACC 2040 
TGTGAGAGTAGGAAAATGGAACCTGAGTGGCTGGGACCCC 2080 
TCTCCTACTTCCCACTATGCATCCATTTTGCC TCTGCCTT 2120 
GATCTTTTTCTCCATCTCTTTTCTCCCTACCCAGCAGGAG 2160 
CCCCACAAACACATGTTGGCTGCTGTGTCCTGCAGTTGGA 2200 

2210 2220 2230 2240 

. ... i .... i ...» i ... . I .... t t i. i 1 iiit 1 iii i 1 

CCAGTGTCCAGGGGTAC AGGCTTCAGGCTGTGACCCACAC 2240 
TGGTACCCACCTCCCTTTCC TATTTTGCCTTAGGTTCATC 2280 
CACGGTTCCCCTGTGGAGCAAGTGGGGGCCCACATAGCTG 2320 
CTGTCCCTGCTGAGGGTTGGTAGCAATCACACCCTCATGT 2360 
CAGCTGGGAGACACGCGCAGTCTCCCAC TGACCCCCAATC 2400 

2410 2420 2430 2440 

. . ■ . i . . i . I . . . . i . . . . I . ■ ■ I 1 ■ I I f I t I I I I ■ I n I 

AACTGAAAATATTGTTTTGACTACTTTTTGTTTTTTTGTT 2440 
TTTTTGTTTTTTTTTTTTTTCGAGACAGAGTTTCTCTGTA 2480 
TAGCCCTGGCTGTCCTGGAACTC AC TTTGTAGACCAGGCT 2520 
GGCCTCGAACTCAAAAATCC TCCTGACTCTGCCTCTGCTT 2560 
CCCAAGTGCTGGGATTAAAGACGTGCGCCACCACCGCCTG 2600 
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mmFATPI full lenqth.DNA 



73/117 



2610 2620 2630 2640 

, , , ■ i i ■ i i I i ■ t • 1 • » i ■ 1 ■ ■ » ' 1 ' ' ■ ■ 1 ■ ' r ' 1 ' 1 ' 1 f 

GCTGTTTTGTATTTTTGTTTTGTTTTGACGATAGGGTCTC 2640 
ACTGTGGAGGCCAAGCTGGCCTCAGACTCCCCACCCCATT 2680 
GCCTCTGGGCACCATTCTATATTCTCAGACTGATGACAAT 2720 
GCACTAGTGTCCCTAGGAGTCTTGAGTCTGCACTTTCCCC 2760 
TCATAGCCTCAAGCTTCCAGAACTGACTCTGATCACTTGG 2800 

2810 2820 2830 2840 

, , , . t . . . , 1 i i i - I r L-i i 1 I » t ■ 1 i ' ' ■ 1 ' ' ' ' 1 1 1 1 1 1 

ATGTGGCTAGTGTTGGCTCTACCCACATGTGTCAATTCAG 2840 
GGGTCCCCAGGCATAGTCTCTGGAAGCCCTCACCCGGAAA 2880 
AAGCTTGGAGAGACCCAGGAAGGTTGTTGTGTTCTCTTGG 2920 
GCACCCCCTGGTGGCAGTCCTGGGCATGCTTCCGCACTGT 2960 
ACTGGTGCATATAGCCCAGACCTATGACATTTGAGGTCTA 3000 

3010 3020 3030 3040 

, , ■ i . . , i I • • i ' I i i i ■ I » ■ i ' ' i ' ' ■ 1 1 ■ ■ 1 1 1 1 ' 1 1 

CCCTTCTGGCTCCTGTGGTCCCCATTGAGATCCTTGGTGA 3040 
CTCACCTCAGTCACCAAGCAGAGCCTCTGCCTGCCTTCAT 3080 
CTTCAAGGTCATGAAGGATGTGGACAGAGCAGCTACAGGC 3120 
TGCCAGCAGTCAACCACATGAGAGTGTTACTTCCTTGTTG 3160 
GTTTTTAAAAAATAAATGTGCTGAGCCTCGAAAAAAAAAA 3200 

3210 3220 3230 3240 

, , . . I . . > r 1 ■ • • ' ' i I ■ 1 » ' ' ' 1 1 1 1 1 1 ' 1 1 ' 1 1 1 1 ' 1 ' 

AAAAAAAAAAAAAAAAA 3217 
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mmFATPI full length.protein 



10 20 30 40 
l_Li i i t I • • ■ ■ 1 ■■■ I .... » 



MRAPGAGTASVASLALLWFLGLPWT^ 40 
urfi R 1 VCKTARROLFGLSVL RVRLELRRHRRAGUT I rL ou 
TfOAVARRQPERLALVOASSG I CWTFAQLDT YSNAVANLF 20 
ROLGFAPGOVVAVFLEGRPEFVGUWLGLAKAGVVAALLNV 160 

nlrrep[afclgtsaakaliygge M aaavaevseqlgksl200 

210 220 230 2*0 



• i/irrSGDLGPES l LPDTQLL0PML.AEAPTTPLAQAPGKGI1 240 
nnoPcv I YT^rTTGLPKAAI VVHSRYYR 1 AAFGHHSYSMR 280 
A4nvf YnCLPLYHSAGN I MGVGQCV I YGLTVVLRKKFSAS 320 

rf2o^vkync^voo?geicryllrqpvrdveqrhrvrl .360 
avgSglrpIiweeftqrfgvpqigefygatecncsianmd 400 

/no «20 «o 440 



GKVGSCGFNSR I LTHVYP I ^Q^p^y yllll a TMK^iAIH^i 480 
prnprFPGLLVGQ I NOQDPLRRFOGYVSDSATNKK I AH5V HBU 
FRKGDSAYLSGOVLVMDELGYMYFRORSGOTFRWRGENVS 520 
t?f v?!vLSRLLGOTOV AVYGVAVPGVEGKAGMAA I AOPH 560 
SQLOPNSMYQELQKVLASYARP . FLRLLPO VDTTGTFK . 0 600 

610 620 , 630 640 

KTRLOREGFipRoisORLFFLDLKSGiRYLPLDERVHARl 640 
CAGOFSL 647 



WO 99/36537 



PCT/US99/00182 



75/117 

mmFATP2 full lenqth.DNA 



10 20 30 40 

. . , . t , . , . i i . ■ . | i i i i I » i i i 1 i i ' » * i 1 1 ' 1 1 ' 1 ■ 1 

GGGCGGAGGCCGAGCCCAGTCGCCAGCTCCTGCTCTGCTC 40 
CTCTCCCGCCTGCCGCCGCGCTGCAGGCCTCGAGCACTCC 80 
CTCGGCCCCGGCGGGGACCGGGGACCCCGCAGCTACCGCC 120 
ATGCTGCCAGTGCTCTACACCGGCCTGGCGGGGCTGCTGC 160 
TGCTGCCTCTGCTGCTCACCTGCTGCTGCCCCTACCTCCT 200 

210 220 230 240 

i r i • 1 • ' ■ ' 1 » 1 1 ' 1 1 1 ■ 1 r 1 1 ' 1 * 



CCAAGATGTGCGGTACTTCCTGCGGCTGGCCAACATGGCC 240 
CGGCGGGTGCGCAGCTACCGGCAGCGGCGACCCGTGCGTA 280 
CCATCCTGCGGGCC TTCCTGGAACAAGCGCGCAAGACCCC 320 
ACACAAGCCCTTCCTGCTGTTCCGAGACGAGACGCTCACC 360 
TACGCCCAGGTGGACCGGCGCAGCAACCAAGTGGCGCGGG . 400 

i^O 420 430 440 

. t .... t , » .... > .... t .... t ■■■■ I 



CGCTGCACGATCAACTGGGCCTACGACAGGGGGATTGCG i 440 
AGCCCTCTTCATGGGCAATGAGCCGGCCTACGTGTGGATC 480 
TGGCTGGGACTGCTCAAACTGGGCTGTCCC ATGGCGTGCC 520 
TCAACTACAAC ATTCGTGCC AAGTCTCTGCTGCACTGCTT 560 
TCAATGCTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAT 600 

610 620 630 640 

I - ... i .... 1 .... t i , ■ i 1 i r i i ? til l 1 



CTACAAGAAGCTGTGGAGGAGGTTCTTCCAACCCTGAAAA 640 
AGGATGCCGTGTCCGTCTTTTACGTAAGCAGAACTTCTAA 680 
CACAAATGGTGTGGACACAATACTGGACAAAGTAGACGGA 720 
GTGTCGGCGGAACCCACCCCGGAGTCGTGGAGGTC TGAAG 760 
TCACTTTTACCACGCCAGC AGTATACATTTATACTTCGGG 800 

810 820 830 840 

I | , , | f | .... 1 I I I I 1 t I t I « I 1 I 



AACCACAGGTC TTCCAAAAAGCGGAACCATCAATCATCAT 840 
CGCCTAAGGTATGGGACAAGCCTTGCTATGTCGAGTGGGA 880 
ATCACGGCCAAGGATGTCATC TATACCAACAATGCCCCTG 920 
TTCCAACAGTGCAACGCTCAAGATCGGCC TTCACGGATGC 960 
ATCCTGGGTTGGGGCTACTTTAACCTTGGCGGGGC AAATT 1000 

1010 1020 1030 1040 

■ ■ .■ .... i .... i ... . 1 ■ ■ » . 1 ■ " ■ ' " " 1 ■ 1 " * 



CTCAAGCAAGCCAATTTTGGGAACGACTGGCAGGAAATAC 1040 
AACGTCAACGGTCATTCAGTACATTGGTGAACTGCTTCGG 1080 
TACCTGTGCAACACACCGC AGAAACCAAATGACCGGGACC 1 120 
ACAAAGTGAAAAAAGCCCTGGGAAATGGCTTACGAGGAGA 11 60 
TGTGTGGAGAGAGTTCATCAAGAGATTTGGGGACATCCAC 1200 

Kj. OH A 
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mmFATP2 full tenqth.DNA 



1210 1220 1230 1240 



GTGTATGAGTTCTACGCATCC ACTGAAGGC AACATTGGAT 1240 
TTGTGAACTATCCAAGGAAAATCGGTGCTGTCGGGAGAGC 1280 
AAACTACCTAC AAAGAAAAGTTGCAAGGTATGAGCTGATC 1320 
AAGTATGACGTGGAGAAGGACGAGCCGGTCCGTGACGCAA 1360 
ATGGATATTGCATCAAAGTCCCCAAAGGTGAGGTTGGACT 1400 

1410 1420 1430 1440 

, , . . i ... i I .... i .... t ... i I i i i i I i t ■ i 1 i i i 1 1 

CTTGGTTTGCAAAATCACACAGCTCACACCATTTATTGGC 1440 
TATGCTGGAGGAAAGACCC AGACAGAGAAGAAAAAACTCA 1480 
GAGATGTCTTTAAGAAAGGCGACATCTACTTCAAC AGCGG 1520 
AGACCTCCTGATGATCGACCGTGAGAACTTCGTCTACHT 1560 
CACGACAGGGTTGGAGATACTTTCCGGTGGAAAGGAGAGA 1600 

1610 1620 1630 1640 

i f i i i i I 1 I 1 , , . . I . i . . ! . r . . I 



ACGTAGCTACCAC AGAAGTCGCTGACATCGTGGGACTGGT 1640 
AGATTTTGTTGAAGAAGTGAATGTGTATGGCGTGCCTGTG 1680 
CCAGGTCATGAGGGTCGAATTGGGATGGCCTCCCTCAAGA 1720 
TCAAAGAAAAC TACGAGTTC AATGGAAAGAAACTCTTTCA 1760 
ACACATCGCGGAGTACCTGCCCAGTTACGCGAGGCCTCGG 1800 

1810 1820 1830 1840 
, 1 , ,..«..■■ I I . . » ' 



TTCCTGAGGATACAAGATACCATTGAGATCACTGGGACTT 1840 
TTAAACACCGCAAAGTGACCCTGATGGAAGAGGGCTTCAA 1880 
TCCCACAGTCATCAAAGATACCTTGTATTTC ATGGATGAT 1920 
GCAGAGAAAACATTTGTGCCCATGACTGAGAACATTTATA 1960 
ATGCCATAATTGATAAAACTCTGAAGCTCTGAATATTCCC 2000 

2010 2020 2030 2040 

■ - -. i .... I , ,-.«.... I .■■ . T ■ » ■ ■ » ■ . » ■ I ■ ' " t 

TGGTGGTTTAGCTCATGACATTTCCAGAAAGAAACTCGAT 2040 
AGACCTCGCAGAGCCACTTCATACGTAG AATCCAACTTTA 2080 
ACTTGATTGAAGACTATAAGGTGCGATTTTATTTTTAGGA 2120 
AATTATTCATTAAAAGGATAGTTTTTTTTTTTTTTTTTAA 2160 
TTACACCTGAACCTTTGCAAGTAAAAAGATTTAGAGACAA 2200 

2210 2220 2230 2240 

, , ■ . | [ ,, ..»....»». . . I «... I . > t i I t , . i I 

TTATTTTTCAATGTGCACCTGCCATTTGTCCTTGCAAACT 2240 
AAGCTTCTTGGAGAGAGGGCCTTATTTTTTTAAAGACATA 2280 
ATAAACTATATTAAC ACTAAAAAAAAAAAAAAAAAAAAAA 2320 
AAAAAAAAAAAAAAAAAA 2338 



WO 99/36537 



PCT/US99/00182 



77/117 

mmFATP2 full length.protein 



10 20 30 40 

.... i .. .. i .... > .... i ■ ■ i it i i I iii i l i i i i 1 

MLPVLYTGLAGLLLL PLLLTCCCPYLLQDVRYFLRLANMA 40 
RRVRSYRQRRPVRT I LRAFLEOARKTPHKPFLLFRDETLT 80 
YAQVDRRSNQVARALHDOLGLRQGDCVALFMGNEPAYVWI 120 
WLGLLKLGCPMACLNYN I RAKSLLHCFQCCGAKVLLASPD 160 
LQEAVEEVLPTLKKDAVSVFYVSRTSNTNGVDT ILDKVDG 200 

210 220 230 240 

. ... t .. .. » i .... t .... t i ■ i i I ■ i i i f 

VSAEPTPESWRSEVTFTTPAVY I YTSGTTGLPKSGT I NHH 240 
RLRYGTSLAMSSGNHGQGCHL YQQCPCSNSATLK I GLHGC 280 
ILGWGYFNLGGANSQASQr './ERLAGNTTST V I OY I GELLR 320 
YLCNTPQKPNDRDHKVKKALGNGLRGDVWREF IKRFGD I H 360 
VYEFYASTEGN I GFVNYPRK I GAVGRANYLQRKVARYEL I 400 

410 420 430 440 

» « » » i .tt.i.ii.t ...ii iti i l ii iil iii iI i ti il 

KYDVEKDEPVRDANGYC I KVPKGE VGLLVCK I TQLTPF IG 440 
YAGGKTQTEKKKLRDVFKKGO I YFNSGDLLM I DRENFVYF 480 
HDRVGOTFRWKGENVATTEVAD I VGLVOFVEEVNVYGVPV 520 
PGHEGR I GMASLK I KENYEFNGKKLFQH I AEYLPSYARPR 560 
FLR IQOT 1 E I TGTFKHRK VTLMEEGFNPTV I KDTLYFMDD 600 

610 620 630 640 

■ ■ ■ * ' i , . , , I . , « i I i i i ■ I i i ■ t I n i i I 

AEKTFVPMTENI YNA I IOKTLKL. 624 



fro. CS 
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78/117 

rnmFATP3 partial.DNA 



10 20 30 40 

. , , i ■ . . . I . ■ ■ ■ l i i • • 1 ■ ■ ' ■ ' ' ' ■ ' 1 1 1 ' « 1 1 1 1 1 1 



GAAAGCTCTGAGAGCGGGTGCAGTCTGGCCTGGCGTCTCG 40 
CGTACCTGGCCCGGG AGCAGCCGACACACACCTTCCTCAT 80 
CCACGGCGCGCAGCGCTTTAGCTACGCGGAGGCTGAGCGC 120 
GAGAGCAACCGGATTGCTCGCGCCTTTCTGCGCGCACGGG 160 
GCTGGACCGGGGGCCGCCGAGGCTCGGGCAGGGGCAGCAC 200 

210 220 230 240 

|_J llttf ■ l l | I ■ i i I I I t 1 1 I fill! tlltl 



TGAGGAAGGCGCACGCGTGGCGCCTCCGGCTGGAGATGCG 240 
GCTGCTAGAGGGACGACCGCGCCCCCTCTGGCACCCGGGG 280 
CGACCGTGGCGCTGCTCCTCCCAGCGGGCCCGGATTTCCT 320 
TTGGATTTGGTTCGGACTGGCCAAAGCTGGCCTGCGCACG 360 
GCCTTTGTGCCCACCGCTTTACGCCGAGGACCCCTGCTGC 400 

410 420 430 440 

, , t ...... I .... i .... I i i ■ ' » ' ' ' ■ » ■ ■ " » ■ ' ■ ■ 1 

ACTGCCTCCGCAGCTGCGGTGCGAGTGCGCTCGTGCTGGC 440 
CACAGAGTTCCTGGAGTCCCTGGAGCCGGACCTGCCGGCC 480 
TTGAGAGCCATGGGGC TCCACCTATGGGCGACGGGCCCTG 520 
AAACTAATGTAGCTGGAATCAGCAATTTGCTATCGGAAGC 560 
AGCAGACCAAGTGGATGAGCCAGTGCCGGGGTACCTCTCT 600 

610 620 630 640 

i i i .... ) ■ ■■■ t ... . 1 , . , i 1 t i ii t 



GCCCCCCAGAACATAATGGACACCTGCCTGTACATCTTCA 640 
CCTCTGGCACTACTGGCCTGCCC AAGGCTGCTCGAATCAG 680 
TC A TC TG AAGG T TC T AC AG TGCCAGGG A TTCTACC A TCTG 720 
TGTGGAGTCCACCAGGAGGACGTGATCTACCTCGCACTCC 760 
CACTGTACC ACATGTCTGGCTCCCTTCTGGGC ATTGTGGG 800 

810 820 830 840 

. ... i ... . i .... t , , I ■ > « » I » ■ i i 1 

CTGCTTGGGCATTGGGGCCACCGTGGTGCTGAAACCCAAG 840 
TTCTCAGCTAGCCAGTTCTGGGACGATTGCCAGAAACACA 880 
GGGTGACAGTGTTCCAGTACATTGGGGAGTTGTGCCGATA 920 
CCTCGTCAACCAGCCCCCGAGCAAGGCAGAGTTTGACCAT 960 
AAGGTGCGCTTGGC AGTGGGCAGTGGGTTGCGCCCAGACA 1000 

1010 1020 1030 1040 

t i i 1 i i t i I t t i i I , , , . 1 .. « « 1 ■ « « »» 



CCTGGGAGCGTTTCCTGCGGCGATTTGGACCTCTGCAGAT 1040 
ACTGGAGACGTATGGCATGAC AGAGGGC AACGTAGCTACG 1080 
TTCAATTACAC AGGACGGCAGGGTGCAGTGGGGCGAGCTT 1 120 
CCTGGCTTTAC AAGC ACATCTTCCCCTTC TCCTTGATTCG 1 1 60 
ATACGATGTCATGAC AGGGGAGCCTATTCGGAATGCCCAG 1200 

fiy 66 A 



WO 99/36537 
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79/117 

mmFATP3 partial.DNA 

1210 1220 1230 1240 

, • , ■ 1 i . . ■ . I * . . i i , . i ■ t ■ . i i l i i ■ . 1 

GGGCACTGCATGACCACATCTCCAGGTGAGCCAGGCCTAC 1240 
TGGTGGCCCCAGTGAGCCAGCAGTCCCCCTTCCTGGGCTA 1280 
TGCTGGGGCTCCGGAGCTGGCCAAGGACAAGCTGCTGAAG 1320 
GATGTCTTCTGGTCTGGGGACGTTTTCTTCAATACTGGGG 1360 
ACCTCTTGGTCTGTGATGAGCAAGGCTTTCTTCACTTCCA 1400 

1410 1420 1430 1440 

■ • « . I I t , ■ , , t . . . . I . . . . f . ■ ■ . I 

CGATCGTACTGGAGACACCATCAGGTGGAAGGGAGAGAAT 1440 
GTGGCCACAACTGAAGTGGCTGAGGTCTTGGAGACCCTGG 1480 
ACTTCCTTCAGGAGGTGAAC ATCTATGGAGTCACGGTGCC 1 520 
AGGGCACGAAGGCAGGGC AGGCATGGCGGCCTTGGCTCTG 1560 
CGGCCCCCGCAGGCTCTGAACCTGGTGCAGCTCTACAGCC 1600 

1610 1620 1630 1640 

I I I .... I , ... 1 ■ .. . i i i i ^ 

ATGTTTCTGAGAACTTGCCACCGTATGCCCGACCTCGGTT 1640 
TCTCAGGCTCCAGGAATCTTTGGCCACTACTGAGACCTTC 1680 
AAACAGCAGAAGGTTAGGATGGCCAATGAGGGCTTTGACC 1720 
CCAGTGTACTGTCTGACCCACTCTATGTTCTGGACCAAGA 1760 
TATAGGGGCCTACCTGCCCCTCACACCTGCCCGGTACAGT 1800 

1810 1820 1830 1840 

t .... i , ... i .... i ,...)■■■ i 1 i i i i I 

GCCCTCCTGTCTGGAGACCTTCGAATCTGAAACCTTCCAC 1840 
TTGAGGGAGGGGCTCGGAGGGTACAGGCCACCATGGCTGC 1880 
ACCAGGGAGGGTTTTCGGGTATCTTTTGTATATGGAGTCA 1920 
TTATTTTGTAATAAACAGCTGGAGCTTAAAAAAAAAAAAA 1960 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1998 



WO 99/36537 



PCT/US99/00182 



80/117 

mmFATP3 partial.protein 



10 20 30 40 



ESSESGCSLAWRLAYLAREQPTHTFL 1 HGAQRFSYAEAER 40 
ESNR I ARAFLRARGWTGGRRGSGRGSTEEGARVAPPAGOA 80 
AARGTTAPPLAPGATVALLLPAGPDFLWI WFGLAKAGLRT 120 
AFVPTALRRGPLLHCLRSCGASALVLATEFLESLEPDLPA 160 
LRAMGLHLWATGPETNVAG I SNLLSEAAOQVOEPVPGYLS 200 
210 220 230 240 



1 i i i i 1 i i • i * 



APQN I MDTCLY I FTSGTTGLPK AAR I SHLKVLQCGGFYHL 240 
CGVHQEDV 1 YLALPLYHMSGSLLG I VGCLG I GAT VVLKPK 280 
FSASQFWDOCQKHRVTVFQY I GELCRYLVNQPPSKAEFDH 320 
KVRLAVGSGLRPDTWERFLRRFGPLQ ILETYGMTEGNVAT 360 
FNYTGRQGAVGRASWLYKH I FPFSL I RYDVMTGEP IRNAQ' 400 

410 420 430 440 

t 1 . . . . I . . . . i . ■ ■ ■ I ■ * » ■ I t ■ i i I 



GHCMTTSPGEPGLLVAPVSQQSPFLGYAGAPELAKDKLLK 440 
DVFWSGOVFFNTGDLLVCDEQGFLHFHDRTGOT I RWKGEN 480 
VATTE VAEVLETLDFLGEVN I YGVTVPGHEGRAGMAALAL 520 
RPPGALNL VOL YSHVSENLPPYARPRFLRLQE SLAT TETF 560 
KOOKVRMANEGFDPSVLSDPLY VLOQD I GAYLPLTPARYS 600 

610 620 630 640 
■ ... r .... I ... . l .... I i i i i I ti ii t iii ii 



ALLSGDLR I. 610 



WO 99/36537 PCT/US99/001 82 



81/117 

mmFATP4 full tenqth.DNA 



10 20 30 40 

ill I i t r I 



ATGCTGCTTGGAGCCTCTCTGGTGGGGGCGCTACTGTTCT 40 
CCAAGCTAGTGCTGAAGCTGCCCTGGACCCAGGTGGGATT 80 
CTCCCTGTTGCTCCTGTACTTGGGGTCTGGTGGCTGGCGT 120 
TTCATCCGGGTCTTCATCAAGACGGTCAGGAGAGAf ATCT 160 
TTGGTGGCArGGTGCTCCTGAAGGTGAAGACCAAGGTGCG 200 

210 220 230 240 

, i , ... i . , , . i .... i .... i «■■ i 1 « * ■ i i ■! i i I 



ACGGTACCTTCAGGAGCGGAAGACGGTGCCCCTGCTGTTT 240 
GCTTCAATGGTACAGCGCCACCCGGACAAGACAGCCCTGA 280 
TTTTCGAGGGCACAGACACTCACTGGACCTTCCGCCAGCT 320 
GGATGAGTACTCCAGTAGTGTGGCCAACTTCCTGCAGGCC 360 
CGGGGCCTGGCCTCAGGCAATGTAGTTGCCCTCTTTATGG 400 



4 10 420 430 440 

I 1 ... .> ... . 1 



AAAACCGCAATGAGTTTGTGGGTCTGTGGCTAGGCATGGC 440 
CAAGCTGGGCGTGGAGGCGGCTCTCATCAACACCAACCTT 480 
AGGCGGGATGCCCTGCGCCACTGTCTTGACACCTCAAAGG 520 
CACGAGCTCTCATCTTTGGCAGTGAGATGGCCTCAGCTA I 560 
CTGTGAGATCCATGCTAGCCTGGAGCCCACACTCAGCCTC 600 

610 620 630 640 

,, . --V.... I 

TTCTGCTCTGGATCCTGGGAGCCCAGCACAGTGCCCGTCA 640 
GCACAGAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAA 680 
GCACCTGCCCAGTCACCCAGACAAGGGTTTTACAGATAAG 720 
CTCTTCTACATCTACACATCGGGCACCACGGGGCTACCCA 760 
AAGCTGCCATTGTGGTGC ACAGCAGGTATTATCG i ATGGC 800 

810 820 830 840 

. . . I ... ill • a • f I 1 I t ■ 1 



TTCCCTGGTGTACTATGGATTCCGCATGCGGCCTGATGAC 840 
ATTGTCTATGACTGCCTCCCCCTCTACCACTCAAGCAGGA 880 
AACATCGTGGGGATTGGC AGTGCTTACTCCACGGCATGAC 920 
TGTGGTGATCCGGAAGAAGTTCTCAGCCTCCCGGTTCTGG 960 
GATGATTGTATCAAGTAC AACTGCACAGTGGTACAGTACA 1000 

1010 1020 1030 1040 

. i . . . . t . . . . > . « • ' ■ ■ i ' I i i ■ » I » ' ' » 1 ' » ' » 1 



TTGGCGAGCTCTGCCGCTACCTCCTGAACCAGCCACCCCG 1040 
TGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCACTGGGC 1080 
AACGGTCTCCGGCAGTCCATCTGGACCGACTTCTCCAGCC 1 120 
GTTTCCACATCCCCCAGGTGGCTGAGTTC TATGGGGCCAC 1 160 
TGAATGCAACTGTAGCCTGGGC AACTTTGACAGCCGGGTG 1200 



WO 99/36537 



PCT/US99/00182 



82/117 

mmFAf P4 full lenqth.DNA 

1210 1220 1230 1240 

, ■ ■ ■ | r ... i .... I i ... l ii i ■ I i t i i I i i i i I 

GGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCTTTGTGT 1240 
ACCCTATCCGTTTGGTACGTGTC AATGAGGATACCATGGA 1280 
ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAA 1320 
CCAGGTCAGCCAGGCCAGCTGGTGGGTCGCATCATCCAGC 1360 
AGGACCCTCTGCGCCGTTTCGACGGGTACCTCAACCAGGG 1400 

1410 I** 20 1430 m0 

, , , ■ i • ■ , i I • i • i I - i - i 1 ■ ' ' '1 ' ' ■ ■ 1 1 1 ' 1 1 1 1 1 1 1 

TGCCAACAACAAGAAGATTGCTAATGATGTCTTCAAGAAG 1440 
GGGGACCAAGCCTACCTCACTGGTGACGTCCTGGTGATGG 1480 
ATGAGCTGGGTTACCTGTACTTCCGAGATCGCACTGGGGA 1520 
CACGTTCCGCTGGAAAGGGGAGAATGTATCTACCACTGAG 1560 
GTGGAGGGCACACTCAGCCGCCTGCT TCATATGGCAGATG loOO 

1610 1620 1630 1640 

i f i • • • i ■ ? t i i t ; • * i . i i t l . i i i I . i i i I i i i 1 1 



TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCG 1640 
AGCAGGAATGGCTGCCGTTGCAAGTCCCATCAGCAACTGT 1680 
GACCTGGAGAGCTTTGCACAGACCTTGAAAAAGGAGCTGC 1720 
CTCTGTATGCCCGCCCCATC TTCCTGCGCTTCTTGCCTGA 1760 
GCTGCACAAGACAGGGACCTTCAAGTTCCAGAAGACAGAG 1800 

1810 1820 1830 ^ 1840 

TTGCGGAAGGAGGGCTTTGACCCATCTGTTGTGAAAGACC 1840 
CGCTGTTCTATCTGGATGCTCGGAAGGGCTGCTACGT iGC 880 
ACTGGACCAGGAGGCCTATACCCGCATCCAGGCAGGCGAG 920 
GAGAAGCTGTGATTTCCCCCTACATCCCTC TGAGGGCCAG 1960 
AAGATGCTGGATTCAGAGCCCTAGCGTCC ACCCCAGAGGG 2000 

20 1 0 2020 2030 ^ 2040 

TCCTGGGCAATGCCAGACCAAAGCTAGCAGGGCCCGCACC 2040 
TCCGCCCCTAGGTGCTGATCTCCCCTCTCCCAAACTGCCA 2080 
AGTGAC TC AC TGCCGC TTCCCCGACCC TCCAGAGGCTT TC 2 20 
TGTGAAAGTCTCATCCAAGCTGTGTCTTCTGGTCCAGGCG 2160 
TGGCCCCTGGCCCCAGGGTTTCTGATAGGCTCCTTTAGGA 2200 

2210 2220 2230 2240 

il l i i i i 1 i t i ' ' 1 > » » * I . . . , I , r , , 1 . . . . I 



TGGTATCTTGGGTCCAGCGGGCCAGGGTGTGGGAGAGGAG 2240 
TCACTAAGATCCCTCCAATCAGAAGGGAGCTTACAAAGGA 2280 
ACCAAGGCAAAGCCTGTAGACTCAGGAAGCTAAGTGGCCA 2320 
GAGACTATAGTGGCCAGTCATCCCATGTCCACAGAGGAK 2360 
TTGGTCCAGAGCTGCCAAAGTGTCACCTCTCCCTGCCTGC 2400 

2410 2420 2430 2440 

, . . . | 1 , I I . ■ t . t . . ■ ■ I 

ACCTCTGGGGAAAAGAGGACAGCATGTGGCCACTGGGCAC 2440 
CTGTCTCAAGAAGTCAGGATCACACACTCAGTCCTTGTTT 2480 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACG 2520 
AGTGTCCTGTCTGTCCTTCCTGCCTGTCTGTGAGTCTGTG 2560 
TTGCTTCTCCATCTGTCCTAGCCTGAGTGTGGGTGGAACA 2600 



WO 99/36537 



PCTAJS99/00182 



83/117 

mmFATP4 full lenqth.DNA 

2610 2620 2630 2640 

.... i .... i ... ■>.... \ . . > ■ i t t i i I i » i t I > i i i t 

GGCATGAGGAGAGTGTGGCTCAGGGGCCAATAAACTCTGC 2640 
CTTGACTCCTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2680 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 27 10 



WO 99/36537 PCT/US99/00182 



84/117 

mmFATP4 full length. protein 



20 30 40 



1 u t t t | .... t .... t 

RRDALRHCLDTSkIrAL I FGSEMASA I CE IHASLEPTLSL 200 

21° *?° ■ 2 ?° 2 ?° 

i i 1 i i i i I iii*' 



FCSGSWEPSTVPVSTEHLDPLLEDAPKHLPSHPOKGFTOK 240 
^yIyTSGTTGLPKAAIVVHSRYYRMASLVYYGFRMRPDO 280 
WYDCLPLYHSSRKHRGDWQCLLHGMTVV I RKKFSASRFW 320 
DOC^KYNCTVVOY IGELCRYLLNQPPREAESRHKVRMALG 360 

SglrSofssrfhipovaefygatecncslgnfdsrv 400 

•4,0 420 ^30 440 

ij^xxJ. , , , I I I ■ i I . i ■ i I ■ ■ ■ ' I " ' ■ ' 



GACGFNSR lsfv yp RL VRVNEDTMEL I RGPDGVC I PCQ 440 
prnPPQLVGR I 1 OQDPLRRFOGYLNQGANNKK I ANDVFKK 480 

gdSayltgovlvmoelgylyfrdrtgdtfrwkgenvstte 520 

vrrTLSRLLHhADVAVYGVEVPGTEGRAGMAAVASP I SNC 560 
SlESFAGTLKKELPLYARPIFLRFLPELHKTGTFKFQKTE 600 

610 620 630 640 

| | ■■ i . i i ■ i ■ ■ ■ ■ i . ■ t ■ I i .■ i I i i i ■ I 

LRKEGFDPSVVKDPLFYLDARKGCY VALDQEAYTR I QAGE 640 

EKL. 644 

Fi 3 . Cf 



85/117 

mmFATPS full lenqth.DNA 



10 20 30 10 

, , ■ ■ ■ ■ ■ . -I , i i ■ I ■ ■ • ■ 1 ■■■ i I ■■■»'■■" ' 
CACTCATCAGAGCTAAGAGAGACTACACGCTCTCATCTAC 40 



CACTCATCAGAGCTAAGAGAGAL I alalia iui.»h..» 
TTCAGAAAGAGCCAATGCCATGGGTATTTGGAAGAAAC1 « - 
ACCTTACTGCTGTTGCTGCTTCTGCTGGTTGGCCTGGGGC 20 
AGCCCCCATGGCCAGCAGCTATGGCTCTGGCCCTGCGTTG 160 
GTTCCTGGGAGACCCCACATGCCTTGTGCTGCTTGGCTTG 200 

210 220 230 240 

, . . . | i I ' i ■ i ■ ■ I ■ ■ i i I i ■ ■ ' ' 

GCATTGCTGGGCAGACCCTGGATCAGCTCCTGGATGCCCC 240 
ACTGGCTGAGCCTGGTAGGAGCAGCTCTTACCTTATTCCT 280 
ATTGCCTCTACAGCCACCCCCAGGGCTACGCTGGCTGC AT 320 

aaIgatgtggct?tcaccttcaagatgcttttctatggcc 360 

TAAAGTTCAGGCGACGCCTTAACAAACATCCTCCAGAGAC 400 

410 420 430 440 

. . . i ■ , ■ , i ■ , ■ ■ i - ■ • ■ I i i i i I i i i i I 

CTTTGTGGATGCTTTAGAGCGGCAAGC ACTGGCATGGCCT 440 
GACCGGGTGGCCTTGGTGTGTACTGGGTCTGAGGGCTCCT 480 
CAATCACAAATAGCCAGCTGGATGCCAGGTCCTGTCAGGC 520 
AGCATGGGTCCTGAAAGCAAAGCTGAAGGATGCCGTAATC 560 
CAGAACACAAGAGATGCTGCTGCTATCTTAGTTCrCCCGT 600 

610 620 630 640 
I i i ■ i I i i ■ ■ 1 



CCAAGACCATTTCTGCTTTGAGTGTGTTTCTGGGGTTGGC 640 
CAAGTTGGGCTGCCCTGTGGCCTGGATCAATCCACACAGC 680 
CGAGGGATGCCCTTGCTACACTCTGTACGGAGCTCTGGGG 720 
CCAGTGTGCTGATTGTGGATCCAGACCTCCAGGAGAACCT 760 
GGAAGAAGTCCTTCCCAAGCTGCTAGCTGAGAACATTCAC 800 

810 820 830 840 



TGCTTCTACCTTGGCCACAGCTCACCC ACCCCGGGAGTAG 840 
AGGCTCTGGGAGCTTCCCTGGATGCTGCACCTTCTGACCC 880 
AGTACCTGCCAGCCTTCGAGCTACGATTAAGTGGAAATCT 920 
CCTGCCATATTCATCTTTACTTCAGGGACCACTGGACTCC 960 
CAAAGCCAGCCATCTTATCACATGAGCGGGTCATACAAGT 1000 

1010 1020 1030 1040 



i ■ . . , , t 



. \ . . ..L^ui. I I , I I . ■ 1 I I ■ . I I ■ ■ 



r Arr AACGTGCTGTCCTTCTGTGGATGCAGAGCTGATGAT 1040 
GTGGTCTATGACGTCC TACCTCTGTACC ATACGATAGGGC 1080 
TTGTCCTTGGATTCCTTGGCTGCTTAC AAGTTGGAGCCAC 1 120 
CTGTGTCCTGGCCCCC AAGTTCTCTGCCTCCCGATTCTGG 1 160 
GCTGAGTGCCGGCAGCATGGCGTAACAGTGATCTTGTATG 1200 



WO 99/36537 PCT/US99/00182 



86/117 

mmFATP5 full lenqth.DNA_ 



1210 1220 1230 1240 

, | . . , 1 I . t . t I ■ i i i 1 i i t i 1 I I I I 1. 



TGGGTGAAATCCTGCGGTACTTGTGTAACGTCCC » GAGCA 1240 

ACCAGAAGACAAGATACATACAGTGCGCTTGGCCATGGGA 1280 

ACTGGACTTCGGGCAAATGTGTGGAAAAACTTCCAGCAAC 1320 

GCTTTGGTCCCAT TCGGATCTGGGAATTCTACGGATCC AC 1360 

AGAGGGCAATGTGGGCTTAATGAACTATGTGGGCCACTGC 1400 

)t\)0 1420 1430 ^ 1 440 

GGGGCTGTGGGAAGGACCAGCTGCATCC TTCGAATGCTGA 1440 

CTCCCTTTGAGCTTGTACAGTTCGACATAGAGACAGCAGA 1480 

GCCTCTGAGGGACAAACAGGGTTTTTGCATTCCTGTGGAG 1520 

CCAGGAAAGCCAGGACTTCTTTTGACCAAGGTTCGAAAGA 1560 

ACCAACCCTTCCTGGGCTACCGTGGTTCCCAGGCCGAGTC 1600 

1610 1620 1630 1640 

Tl*"****'^ i t i i I iiitI llll l I I I t I I 1 T I I — — 

CAATCGGAAACTTGTTGCGAATGTACGACGCGTAGGAGAu 1640 

CTGTACTTCAACACTGGGGACGTGCTGACCTTGGACCAGG 1680 

AAGGCTTCTTCTACTTTC AAGACCGCCTTGGTGACACCTT 1720 

CCGGTGGAAGGGCGAAAACGTATCTACTGGAGAGGTGGAG 1760 

TGTGTTTTGTCTAGCCTAGACTTCCTAGAGGAAGTCAA.G 1800 

1810 1820 1830 1840 

, I | ■ , t ■ ■ • ■ ■ I ■ ' ' ' 1 ' ' ' ' ? 1 ' ' 1 1 ' 1 1 1 1 



TCTATGGTGTGCCTGTGCC AGGGTGTGAGGGTAAGGTTGG 1840 
CATGGCTGCTGTGAAACTGGCTCCTGGGAAGACTTTTGAT 1880 
GGGCAGAAGCTATACCAGCATGTCCGCTCCTGGCTCCCTG 920 
CCTATGCCACACCTCATTTCATCCGTATCCAGGATTCCC i I960 
GGAGATCACAAACACCTACAAGCTGGTAAAGTCACGGCTG 2000 

2010 2O20 2030 2040 

, \ ,,..>... ,«•■» i i . .■ t i i i t 1 i i i f I ' t i > 1 



GTGCGTGAGGGTTTTGATGTGGGGATCATTGCTGACCCCC 2040 
TCTACATACTGGACAACAAGGCCCAGACCTTCCGGAGTCT 2080 
GATGCCAGATGTGTACCAGGCTGTGTGTGAAGGAACCTGG 2 20 
AATCTCTGACCACCTAGCCAACTGGAAGGCAATCCAAAAG 2160 
TGTAGAGATTGACAC TAGTCAGCTTCACAAAGTTGTCCGG 2200 



2210 2220 2230 2240 
, I , ■ • ■ ' I > ... i ... i I 



GTTCCAGATGCCCATGGCCCAGTAGTACTTAGAGAATAAA 2240 
CTTGAATGTGTATAC AAAAAAAAAAAAAAAAAAAAAA 2277 



Pj. 70 & 



WO 99/36537 



PCT/US99/00182 



87/117 

mmFATPS fulUength.protein. 



20 



7 i , ■ ■ ■ L - - . i ■ ■ ■ ' • • ■ ■ • • ■ 1 1 

"mp i wil iC l in L lllULVGLGQPPWPAAMALALRWFLGOPT 40 
fL^^^OrTROAAlfuVUPSKT.SALSVrLGLAKLGCPV^OO 



210 -. 2 ?°...» 

' ' \ , ,^AiD.\ l u^vRqSGASVL VDPOLGENLEEVLPK 240 

f if IpN^HCFYLGH^ 280 
LLAEN i HCFYLGHSbh^ irb QVSNVLSF 320 

z|10 ^20 ^0 4*40 

610 620 . 630 640 



^' ^l^rA^ ' vAHVRSWLPA YATPHFlRIQDSLEtTNTY 640 
KLVKSRLVREGFDVG^ f ADPLY*LONKAOTFRSLMPOVYQ 680 
AVCEGTWNL. 690 

f,y 7/ 



WO 99/36537 



PCT/US99/00182 



88/117 

dmFATP partial.ONA . . 



10 20 30 40 

, , ■ - I ,,. ' 1 1 i I . ■ ...I 

GCTCTCTGGGCCTATATCAAGCTGCTGAGGTACACGAAGC 40 
GCCATGAGCGGCTCAACTACACGGTGGCGGACGTCTTCGA 80 
ACGAAATGTTCAGGCCCATCCGGACAAGGTGGCTGTGGTC 20 
AGTGAGACGCAACGCTGGACCTTCCGTCAGGTGAACGAGC 160 
ATGCGAACAAGGTGGCCAATGTGCTGCAGGCTCAGGGCTA 200 

210 220 t 2 ?°,,,, ,., 2 ?° 

CAAAAAGGGCGATGTGGTGGCCCTGTTGCTGGAGAACCGC 240 
GCCGAGTACGTGGCCACCTGGCTGGGTCTCTCCAAGATCG 280 
GTGTGATCACACCGCTGATCAACACGAATCTGCGCGGTCC 320 
CTCCCTGCTGCACAGCATC ACGGTGGCCCATTGCTCGGCT 360 
CTCATTTACGGCGAGGACTTCCTGGAAGCTGTCACCGACG 400 

iUO 420 430 440 
J I ' I ..I... . 1 - 



TGGCCAAGGATCTGCCAGCGAACCTCACACTCTTCCAGTT 440 
CAACAACGAGAACAACAACAGGGAGACGGAAAAGAACATA 480 
CCGCAGGCCAAGAATCTGAACGCGCTGCTGACCACGG CA 520 
GCTATGAGAAGCCTAACAAGACGCAGGTTAACCACCACGA 560 
CAAGCTGGTCTACATCTACACCTCCGGCACCACAGGAT'TG 600 

610 620 630 640 

| t . . , I ■ . » ■ f ' ■ » ■ I ■ t ' t ' 

CCAAAGGCTGCGGTTATCTCTCACTCCCGTTATCTGTTTA 640 
TCGCTGCTGGCATCCACTACACCATGGGTTTCCAGGAGGA 680 
GGACATCTTCTACACGCCCTTGCCTTTGTACCACACCGCT 720 
rGTGGCATTATGTGCATGGGTCAGTCGGTGCTCTTTGGCT 760 
CCACGGTCTCCATTCGCAAGAAGTTCTCGGCATCCAACTA 800 

810 e20 | 830 840 

TTTCGCCGACTGCGCC AAGTATAATGCAACTATTGGTCAG 840 
TATATCGGTGAGATGGCTCGC TACATTCTAGCTACGAAAC 880 
CCTCGGAATACGACCAGAAACACCGAGTGCGTCTGGTCTT 920 
TGGAAACGGACTGCGACCGCAGATTTGGCCACAGTTTGTG 960 
CAGCGCTTCAACATTGCCAAGGTTGGCGAGTTCTACGGCG 1000 

1010 1020 1030 1040 

, | i . . . . i , ■ ' ■ i i i ' ■ ■ • • ' • ■ ■ ' 1 ■ ■ ' 1 1 — 



, ... I .... I ... , I r ■ ' ' ■ ' 

CCACCGAGGGTAATGCGAACATCATGAATCATGACAACAC 1040 
GGTGGGCGCCATCGGCTTTGTGTCGCGCATCCTGCCCAAG 1080 
ATCTACCCAATCTCGATCATTCGCGCCGATCCGGACACCG 1 120 
gIgIgCCCATTAGAGATAGGAATGGCCTATGCCAACTGTG 1160 
CGCTCCCAACGAGCCAGGCGTATTCATCGGC AAGATCG i C 1200 

fia. -72 A 



WO 99/36537 



PCT/US99/00182 



89/117 

dmFATP partiai.DNA : 

1210 1220 1230 1240 

.... i .. .. i ... . ' • • - ■ ' ■ » ■ ■ t » . t ■ t i i i . t i t i i f 

AAAGGAAATCCTTC TCGCGAATTCCTCGGATACGTCGATG 1240 
AAAAGGCCTCCGCGAAGAAGATTGTTAAGGATGTGTTCAA 1280 
GCATGGCGATATGGCTTTCATCTCCGGAGATCTGCTGGTT 1320 
GCCGACGAGAAGGGTTATCTGTAC TTCAAGGATCGCACCG 1360 
GTGACACCTTCCGCTGGAAGGGCGAGAATGTTTCCACCAG 1400 

1410 1420 1430 1440 

■ t i » i i I t i i i 1 i i t ■ I i t i « I i t i i I 

CGAGGTGGAGGCGCAAGTCAGCAA TGTGGCCGGTTACAAG 1440 
GATACCGTCGTTTACGGCGTAACCATTCCGCACACCGAGG 1480 
GAAGGGCCGGCATGGCCGCCATC TATGATCCGGAGCGAGA 1520 
ATTGGACCTCGACGTCTTCGCCGCTAGCTTGGCCAAGGTG 1560 
CTGCCCGCGTACGC TCGTCCCC AG ATCATTCGATTGCTCA 1600 

1610 1620 1630 1640 

. ... i i i .... t ■ . * . I ■ ■ t . 1 

CCAAGGTGGACCTGACTGGAACCTTTAAGCTGCGCAAGGT 1640 
AGACCTGCAGAAGGAGGGCTACGATCCGAACGCGATCAAG 1680 
GACGCGCTGTACTACCAGACTTCCAAGGGTCGGTACGAGC 1720 
TGCTCACGCCCCAGGTTTACGACCAGGTGCAGCGCAACGA 1760 
AATCCGCTTCTAAGAGCTGCAATAGAGTTGTGTCTGAACC 1800 

1810 1820 1830 1840 

. . . . I ■ I I » ■ ■ I T t I t t t » I 1 I I I If \ 

TTGCCTTTTGCCCAATATGCTGTTAATTAGTTTGTAAGGC 1840 
TAAGTGTAGTAGAGGAAAATCGGGGGAAATCGGCAGCAAA 1880 
GATCATTCAGCCTAGGAGAGATGCATCCGAAGCACATTTC 1920 
CATGTCAACAATGCACTTTTGTATATCGTAAGCATATATA 1960 
TATCGTATATCGTAAACGTAGTTGTATCTGCATTTGTGTA 2000 

2010 2020 2030 2040 

.... i .... I .... i .... i «■ t i i i i i t 1 i i i i l i i i i I 

GATGATAGCCTCCTATACGCATTTCAATTGTTTTTAGCGT 2040 
GCTAAAGAACCTTGTTAAATGCAATTTCAGCTATTGTTTA 2080 
GTCAGTTTTAGTGGCATTTACACTTCCATTCTCGTTGCGT 2120 
TTCGTTTTTGCCTGT AC ATATGAGAAGC TCTGATGTTTTT 2160 
GTATCAAATAAAGTTTTTTCCTTCACCACGGACCACGTGA 2200 

2210 2220 2230 2240 

. . . . t ■ , . ■ i . . . t | i - * » I ■ * • • 1 ■ i ' ■ 1 1 ■ 1 ' ' ' i 1 ■ 1 

AAAAAAAAAAAAAAAAAAAAA 2221 




WO 99/36537 



PCT/US99/00182 



90/117 

dmFATP Dartial-orotein 



10 20 30 40 

■ l.ii 



ALWAY IKLLRYTKRHERLNYTVAOVFERNVQAHPOKVAVV AO 
SETORWTFRQVNEHANK VANVLOADGYKKGO VVALLLENR 80 

AE Y VATWLGLSK I GV I TPLI NTNLRGPSLLHS 1 TV AHCSA 20 
L I YGEDFLEAVTDVAKDLPANLTLFQFNNENNNSETEKN 1 1 60 
PQAKNLNALLTTASYEKPNKTQVNHHOKLVY I YTSGTTGL 200 

210 220 230 240 

. i ...... . I ' ■ ■ ■ ■ I , , i , I ■ i i ■ I i i i . I 

PKAAV I SHSRYLF I A AG I HYTMGFQEEO I FYTPLPLYHTA 210 
GG^MCMGQSVLFGSTVS I RKKFSASNYFADC AKYNAT I GQ 280 
Y 1 GEMARY ILATKPSEYDQKHRVRLVFGNGLRPQ 1WP0FV 320 
QRFN I AKVGEFYGATEGNAN I MNHONTVGA I GFVSR 1LPK 360 
UP I S I 1 RADPDTGEP I RORNGLCQLCAPNEPGVF I GK I V 400 
410 420 430 440 

. . .. i I ■ - - - 1 ■ ■ ' 1 1 ' ' 1 ' 1 

KTNPSREFLGYVDEKASAKK I VKDVFKHGDMAF I SGOLLV 440 

adek!y[yfkdrtIotfrwkgenvstseveaqvsnvagyk 480 

DTVVYGVT IPHTEGRAGMAA I YDPERELDLDVFAASLAKV 520 
LPAYARPQI I RLLTK VDLTGTFKLRK VDLQKEGYDPNA I K 560 
DALY YQTSKGRYELLTPQV YDQVQRNE I RF 590 



WO 99/36537 



PCT/US99/00182 



91/117 

drFATP partial.DNA 



10 20 30 40 

, ... i .. ..<■.. . i ■ » ■ > I .... t ... . I t t t t f t i it I 

AGTGTAGATACCACAGGAACGTTTAAAAT CCAGAAGACCA 40 
GACTGCAAAGGGAAGGATACGATCCACGGCTC ACAACTGA 80 
CCAGATCTACTTCCTAAACTCCAGAGCAGGGCGTTACGAG 120 
CTTGTCAACGAGGAGCTGTACAATGCATTTGAACAAGGGC 160 
AGGATTTCCCTTT 173 



WO 99/36537 



PCT/US99/00182 



92/117 

drFATP partial.protein 



10 20 30 40 

1 1 . ■ . ■ I . * . ■ I ■ . » . 1 I ■ 1 I 1 ' ■ »' I I 1 H 1 

SVDTTG7FK IQKTRLQREGYOPRLTTDQ I YFLNSRAGRYE 40 
LVNEELYNAFEQGQDFP 57 



fia. 15 



93/117 

ceFATPa coding only.DNA 



10 20 30 TO 

, , , ■ « . . ■ ■ 1 t • - • I • • i i 1 i ' ' 1 { ■ 1 ' 1 1 1 1 ' 1 1 1 1 1 1 ] 

ATGAAGCTGGAGGAGCTTGTGACAGTTATGCTTCTCACAG 40 
TGGCTGTCATTGCTCAGAATCTTCCGATTGGAGTAA7ATT 80 
GGCTGGAGTTCTTATTTTATACATCACAGTGGTTCATGGA 120 
GATTTCATTTATAGAAGTTATCTTACGTTGAATAGGGATT 160 
TAACAGGATTGGCTCTAATTATTGAAGTCAAAATCGACCT 200 

210 220 230 240 

i . ■ . ■ i i ■ • ■ < I 



ATGGTGGAGGTTGCATCAGAATAAAGGAATCCATGAACTG 240 
TTTTTGGATATTGTGAAAAAGAATCCAAATAAGCCGGCGA 280 
TGATTGACATCGAGACGAATACAACAGAAACATACGCAGA 320 
GTTCAATGCACATTGTAATAGATATGCCAATTATTTCCAG 360 
GGTCTTGGCTATCGATCCGGAGACGTTGTCGCCTTGTACA 400 



£M0 420 430 440 

I . . .. I . . . » 1 t ... I L 



TGGAGAACTCGGTCGAGTTTGTGGCCGCGTGGATGGGACT 440 
CGCAAAAATCGGAGTTGTAACGGCTTGGATCAACTCGAAT 480 
TTGAAAAGAGAGCAACTTGTTCATTGTATCACTGCGAGCA 520 
AGACAAAGGCGATTATCACAAGTGTAACACTTCAGAATAT 560 
TATGCTTGATGCTATCGATC AGAAGCTGTTTGATGTTGAG 600 

610 620 630 640 

i \ | t . . t , i i ■ < ■ i i . * i ■ i ■ 1 ' i ' 1 1 



GGAATTGAGGTTTACTCTGTCGGAGAGCCCAAGAAGAATT 640 
CTGGATTCAAGAATCTC AAGAAGAAGTTGGATGCTCAAAT 680 
TACTACGGAACCAAAGACCCTTGACATAGTAGATTTTAAA 720 
AGTATTCTTTGCTTCATCTATACAAGTGGTACTACTGGAA 760 
TGCCAAAAGCCGCTGTCATGAAGCACTTCAGATATTACTC 800 

810 820 830 840 

I I . ■ ■ » ' ■ » i ■ » ■ > ■ ■ 1 ■ 1 ' ■ * 1 1 " 1 1 ' 1 ' 1 

GATTGCCGTTGGAGCCGCAAAATCATTCGGAATCCGCCCi 840 

TCTGATCGTATGTACGTCTCGATGCCAATTTATCAC ACTG 880 

CAGCTGGAATTCTTGGAGTTGGGCAAGCTCTGTTGGGTGG 920 

ATCATCGTGTGTCATTAGAAAAAAATTCTCGGCTAGCAAC 960 

.'^C^^^^-TTOT^TAAAriTATnATTGTACAGTTTCAC 100C 



ATCATCGTGTGTCATTAGAAAAAAAi ililuou^^ 
TTTTGGAGGGATTGTGTAAAGTATGATTGTACAGTTTCAC 1000 

1010 1020 1030 1040 

i t ' I . . . . t ■ i i ■ I i i t t 1 i i i j 1 

AATACATTGGAGAGATTTGTCGGTACTTGTTGGCTCAGCC 1040 
AGTTGTGGAAGAGGAATCC AGGCATAGAATGAGATTGTTG 080 
GTTGGAAACGGACTCCGTGC TGAAATCTGGCAACCATTTG 1 120 
TAGATCGATTCCGTGTC AGAATTGGAGAACT7TATGGTTC 1160 
AACTGAAGGAACTTCATCTCTCGTGAACATTGACGGACA i 1200 



WO 99/36537 PCT/US99/00182 



94/117 

ceFATPa coding only.DNA 



1210 1220 1230 1240 

, ] . ■ t ! | fl ! . . 1 ■ * » ■ 1 • ' * 1 1 i 1 ' 1 1 1 ' 1 ' I 



GTCGGAGCTTGCGGATTCTTGCCAATATCCCCATTAACAA 1240 
AGAAAATGCATCCGGTTCGATTAATTAAGGTTGATGATGT 1280 
CACTGGAGAAGCAATCCGAACTTCCGATGGACTTTGCATT 1320 
GCATGTAATCCAGGAGAGTCTGGAGCAATGGTGTCGACGA 360 
TCAGAAAAAATAATCCATTATTGCAATTCGAGGGATATCT 1400 

1410 1420 1430 1440 

■ i ■ ■ ■ ■ i .... i .... i ■ . . . I . ■ ■ . i . i . . I 



, , - ■ i ■ ■ ■ ■ ' ■ ■ ■ ■ ' ■ i ■ ■ 

GAATAAGAAGGAAACGAATAAAAAGATTATCAGAGATGTC 1440 

TTCGCAAAGGGAGATAGTTGCTTTTTGACTGGAGATCTTC 480 
TTCATTGGGATCGTCTTGGTTATGTATATTTCAAGGATCG 1520 
TACTGGAGATACTTTCCGTTGGA AGGGAGAGAATGTGTCG 1 560 
IcTACTGAAgVcGAGGCAATTCTTCATCCAATTACTGGAT 1600 

1610 1620 1630 1640 

I i i i ■ 1 ■ - ■ ■ 1 • • ■ ■ 1 ■ ■ ■ 1 1 ' ' ' 1 1 1 " 1 1 " ' ' 1 



TGTCTGATGCAACTGTTTATGGTGTAGAGGTTCCTCAAAG 1640 
AGAGGGAAGAGTTGGAATGGCGTCAGTTGTTCGAGTTGTA 680 
TCGCATGAGGAAGATGAAACTCAATTTGTTCATAGAGTTG 720 
GAGCAAGACTTGCCTCTTCGCTTACCAGCTACGCGATTCC 760 
TCAGTTTATGCGAATTTGTCAGGATGTTGAGAAAACAGGT 1800 

1810 1820 1830 1840 

t | f , , \ . , . . | . . , . t , . , i ■ | | | T 1 I 1 ' I I * ' I ' I 

ACATTCAAACTTGTGAAGACGAATCTACAACGATTAGGTA 1840 
TCATGGATGCTCCTTCAGATTCAATTTACATCTACAATTC 880 
TGAAAATCGCAATTTTGTGCCGTTCGACAATGATTTGAGG 1920 
TGCAAGGTCTCACTGGGAAGTTATCCATTTTAA 1953 . 



fia. 76 e> 



WO 99/36537 



95/117 

ceFATPa coding onl y .protein 



10 20 30 40 

t ■ I I » » I » ' t I t 1 1 I t I I I 1 T I t 

MKLEELVTVMLLTVAV I AQNLP ! GV I LAGVL I LY I TVVHG 40 
OF I YRSYLTLNROLTGLAL I I EVKl DLWWRLHQNKG I HEL 80 
FLD I VKKNPNKPAM I 0 I ETNTTETYAEFNAHCNRYANYFQ 120 
GLGYRSGDV VALYMENSVEFVAAWMGLAK I GVVTAWI NSN 160 
LKREQLVHC ITASKTKAI I TSVTLQN I MLDA I DQKLFDVE 200 

210 220 230 240 

.... t .... i .... i ... . i .... t ......... i ■ ■ . ■ I 

G I EVYSVGEPKKNSGFKNLKKKLDAQ I TTEPKTLD I VDFK 240 
S I LCF I YTSGTTGMPKAAVMKHFRYYS I AVGAAKSFG I RP 280 
SDRMYVSMP I YHTAAG ILGVGQALLGGSSCV I RKKFSASN 320 
FWRDCVKYDCTVSQY I GE I CRYLLAQPVVEEESRHRMRLL 360 
VGNGLRAE I WQPFVDRFR VR I GELYGSTEGTSSLVN I OGH 400 

410 420 430 440 

.... i ......... i .... i ... ■ t . , , i 1 i , i ■ 1 « i ■! I 

VGACGFLP I SPLTKKI1HPVRL I KVDDVTGEA I RTSDGLC I 440 
ACNPGESGAMVST I RKNNPLLQFEG YLNKKETNKK I I RDV 480 
FAKGDSCFLTGDLLHWDRLGY VYFKORTGOTFRWKGENVS 520 
TTEVEA I LHP 1 TGLSDATVYGVEVPQREGR VGMASVVRVV 560 
SHEEDETGFVHRVGARLASSLTSYA I PQFhR ICQOVEKTG 600 

610 620 630 640 

' ' I . , , . I t t i ■ f i i i M i ■ i i 1 

TFKLVKTNLQRLG I MDAPSOS I Y I YNSENRNFVPFDNDLR 640 
CKVSLGSYPF. 651 



96/117 

ceFATPb coding only.DNA , 



10 20 30 40 

i . ■ ■ I 11 t i 1 1 1 t i i i I 

ATGAGGGAAATGCCGGACAGTCCCAAGTTTGCGTTAGTCA 40 
CGTTTGTTGTGTATGCAGTGGTTTTGTACAATGTCAACAG 80 
CGTTTTCTGGAAATTTGTATTC ATCGGATATGTTGTATT I 20 
AGGCTGCTTCGCACTGATTTTGGAAGAAGAGCACTTGCCA 160 
CGTTACCTAGAGATTTTGCGGGACTGAAGCTCTTAATA , C 200 

210 220 230 240 
I . . , , i , I . ■ . ■ i t . i i I 



rGTTAAGTCGACAATTCGTGGCTTGTTCAAGAAAGATCGC 240 

ccII^cItgaaatctttttgaatcaggtgaaacagcatc 280 

CAAACAAAGTGGCGATTATTGAAATTGAAAGTGGTAGGCA 320 
GTTGACGTATCAAGAATTGAATGCGTTAGCTAATCAGTAT 360 
GCTAACCTTTACGTGAGTGAAGGTTACAAAATGGGCGACG 400 

4 JO 420 430 440 

■ [ ' l , , . , l . . ■ i I ■ i . ■ I ■ i i -l! 

TTGTCGCTTTGTTTATGGAAAATAGCATCGACTTCTTTGC 440 
II?"gGCTGGGACTTTCCAAGATTGGAGTCGTGTCGGCG 480 
TTCATCAACTCAAACTTGAAGTTGGAGCCATTGGCACATT 520 
C^TAATGTTTCGAAGTGCAAATCATGCATTACCAATAT 560 
CAATCTGTTGCCGATGTTCAAAGCCGCTCGTGAAAAGAAT 600 

610 620 630 640 

■ ... I I i r 1 1 ' 1 ' 1 ' 1 ' 1 1 1 1 1 1 1 

CTGATCAGTGACGAGATCCACGTGTTTCTGGCTGGAACTC 640 
AGGTTGATGGACGTCATAGAAGTCTTCAGCAAGATCTCCA 680 
TCTTTTCTCTGAGGATGAACCTCCAGTTATAGACGGACTC 720 
AATTTTAGAAGCGTTCTGTGTTATATTTACACTTCCGGTA 760 
CTACCGGAAATCCAAAGCCAGCCGTCATTAAACACTTCCG 800 

810 820 830 840 

.... i ... i I .... i .... 1 ■■ ■ i I i i ■ ■ I i i i i I ' ' ' ' I 

TTACTTCTGGATTGCGATGGGAGCAGGAAAAGCATTTGGA 840 
ATTAATAAGTCAGACGTTGTGTACATTACGATGCCAATGT 880 
ATCACTCTGCCGCCGGTATCATGGGTATTGGATCATTAAT 920 
TGCATTCGGGTCGACCGCTGTTATTAGGAAAAAGTTTTCG 960 
GCAAGCAACTTCTGGAAAGATTGCGTCAAGTACAACGTCA 1000 

1010 1020 1030 1040 

. . . , ) , t . ■ I . ■ I t 1 1 ■ ■ 1 I • ■ • ■ I ■ » ' ■ I t t t t 1 I I I I I . 

CAGCGACACAGTACATTGGAGAAATCTGCAGGTATCTTCT 1040 
GGCAGCGAATCCATGTCCTGAAGAGAAACAACACAACGTG 1080 
CGATTGATGTGGGGAAATGGTTTGAGAGGACAAATTTGGA 1 20 
AAGAGTTTGTAGGAAGATTTGGAATTAAGAAAATTGGAGA 160 
GTTGTACGGC TCAAC AGAAGGAAACTCCAATATTGTTAAC 1200 



fiy -7 2 A 



WO 99/36537 PCI7US99/00182 



97/117 

ceFATPb coding only.DNA 

1210 1220 1230 1240 

» ■ ■ i 1 • t i . I t ! ■ . I i t i . I « . t . t t | i t 1 » i t t t i . i ■ I 

GTGGATAACCATGTTGGAGCTTGTGGATTCATGCCAATTT 1240 
ATCCCCATATTGGATCCCTCTACCCAGTTCGACTTATTAA 1280 
GGTTGATAGAGCCACTGGAGAGC TTGAACGTGATAAGAAC 1320 
GGACTCTGTGTGCCGTGTGTGCCTGGTGAAACTGGGGAAA 1360 
TGGTTGGCGTTATCAAGGAGAAAGATATTCTTCTAAAGTT 1400 

1410 1420 1430 1440 

♦ i .. .. t ♦...».... I .... t ■■» ■ I i ■ . i t ■ ♦ ■ i I 

CGAAGGATATGTCAGCGAAGGGGATACTGCAAAGAAAATC 1440 
TACAGAGATGTGTTCAAGCATGGAGATAAGGTGTTTGCAA 1480 
GTGGAGATATTCTTCATTGGGATGATCTTGGATACTTGTA 1520 
CTTTGTGGACCGTTGTGGAGACACTTTCCGTTGGAAAGGG 1560 
GAGAACGTGTCAACTACTGAAGTTGAGGGAATTCTTCAGC 1600 

1610 1620 1630 1640 

.... i . f .. t .... i . ... i .... t .... I .... i .... I 

CTGTGATGGATGTGGAAGATGCAACTGTTTATGGAGTCAC 1640 
TGTCGGTAAAATGGAGGGGCGTGCCGGAATGGCTGGTATT 1680 
GTCGTCAAGGATGGAACGGATGTTGAGAAATTCATCGCCG 1720 
AT ATT AC T TC TCGAC TG AC CGAA A ATC TGGC GTC T T AC GC 1760 
AATCCCTGTTTTCATTCGGCTGTGCAAGGAAGTTGATCGA 1800 

1810 1820 1830 1840 

.... i i ■ t . . i .... i ... i \ !■ i t i ■ t t t I 

ACCGGAACCTTCAAACTCAAGAAGACTGATC TTCAAAAAC 1840 

AAGGTTACGACCTGGTTGCTTGTAAAGGAGACCCAATTTA 1880 

CTACTGGTCAGCTGCAGAAAAATCCTACAAACCACTGACT 1920 

GACAAAATGCAACAGGATATTGACACTGGTGTTTATGATC 1960 
GCATTTAA 1968 



WO 99/36537 



PCT/US99/00182 



98/117 



ceFATPb coding only.protein 



10 20 

■ * i i I ■ ' I » f ' » i i 1 i • » i I • 



30 



40 

j t i l i i i i I 



MREMPDSPKFALVTFVV YAV VL YNVNS VFWKFVF I GYVVF 40 
RLLRTDFGRRALATLPRDFAGLKLL I SVKST I RGLFKKDR 80 
P I HE I FLNQVKQHPNKVA I I E I ESGRQLT YQELNALANQY 120 
ANLYVSEGYKMGDVVALFMENS I DFFA I WLGLSK I GVVSA 160 
F I NSNLKLEPLAHS I NVSKCKSC I TN I NLLPMFKAAREKN 200 

210 220 230 240 

' 1 1 1 1 ' « ■ ■ 1 i i i » 1 > ■ i » I i i ■ i 1 ' i * i 1 ■ i - i I • i i i » 

L ISDE IHVFLAGTQVDGRHRSLOQDLHLFSEDEPPV JDGL 240 
NFRSVLC Y I YTSGTTGNPKPAV I KHFRYFWI AliGAGKAFG 280 
I NKSDVVY I TMPM YHSAAG IMG I GSL I AFGSTAV I RKKFS 320 
ASNFWKDCVKYNVTATQY I GE I CRYLLAANPCPEEKQHNV 360 
RLMWGNGLRGQ I WKEFVGRFG IKK I GELYGSTEGNSN I VN 400 

410 420 430 440 

1 1 1 1 F 1 ■ i 1 1 i i 1 ' I ■ i ' i 1 i i ■ ■ 1 i » i ■ I i i i i » ■ ■ ■ ■ ( 

VDNHVGACGFMP I YPHI GSLYPVRL I KVDRATGELERDKN 440 
GLCVPCVPGETGEMVGV I KEKD I LLKFEGYVSEGDTAKK I 480 
YROVFKHGOK VFASGD I LHWDDLG YL YFVDRCGOTFRWKG 520 
ENVSTTEVEG I LQPVMDVEDATVYGVTVGKMEGRAGMAG I 560 
VVKOGTDVEKF I AO I TSRLTENLASYA I P VF I RLCKEVOR 600 

610 620 630 640 ' 
1 ' 1 1 1 ' 1 ■ * f ......... t ... . i .... i .... i . T , f f 

TGTFKLKKTDLOKQGYDLVACKGDP I Y YWSAAEKSYKPLT 640 
OKMQQD I DTGVYOR 1 . 656 




WO 99736537 



PCT/US99/00182 



99/117 

chFATP coding only.DNA , 



10 20 30 40 

■ ■ . . i . . , . i , , , , I ■ » ■ > ■ • ' ■ 1 " " 1 ' " 1 1 

ATGGCGTGTATGC ATCAGGCTCAGCTATACAATGATCTAG 40 
AGGAATTGCTAACTGGTCC ATCAGTACCCATCGTTGCTGG 80 
AGCTGCTGGAGCTGCAGCTCTCACTGCCTACATTAACGCC 20 
AAATACCACATAGCCCATGATCTCAAGACCCTCGGTGGTG 160 
GATTGACACAATCGTCCGAAGCGATTGATTTCATAAACCG 200 

210 220 230 240 

, , ■ . ■ ... i I i ...».■■■ I i ■ i ■ t i i ' t I ■ i i t I it i » I 

CCGCGTCGCACAAAAGCGCGTCC TCACGCACCACATCTTC 240 
CAGGAGCAGGTCCAAAAACAATCAAATCATCCCTTTCTTA 280 
TCTTTGAGGGCAAGAC ATGGTCTTACAAGGAGTTCTCTGA 320 
GGCATACACGAGGGTCGCGAACTGGCTGATTGATGAGCTG 360 
GACGTACAAGTAGGGGAGATGGTCGCAATTGATGGCGGAA 400 

£110 420 430 440 

1 ■ , ■ , 1 I I i ... I i i i i 1 f ■ i . 1 i m I 



ATAGTGCAGAGCACCTGATGCTTTGGCTTGCACTTGATGC 440 
AATCGGTGCGGCTACGAGTTTTTTGAACTGGAACCTGACA 480 
GGGGCAGGGTTAATTCATTGCATAAAGCTATGCGAATGTC 520 
GATTCGTTATCGCAGACATCGATATTAAAGCGAACATTGA 560 
ACCGTGCCGTGGCGAACTGGAGGAGACGGGCATCAACAT . 600 

610 620 630 640 



CACTACTATGACCCATCCTTGATCTCATCGCTACCGAATA 640 
ACACGCCAATTCCCGACAGCCGCACTGAGAACATTGAATT 680 
AGATTCAGTACGAGGACTGATATACACATCTGGAACCACT 720 
GGTCTACCTAAAGGCGTGTTTATAAGCACTGGCCGCGAGC 760 

??Igg5ctgactggtcgatttcaaagtatc 800 

810 820 830 840 

I 1 . . , , I . I I . I I » ■ « I I I I ■ I I ■ I I I 

GCCCACGGATCGAATGTATACATGTATGCCGCTCTACCAT 840 
GCCGCTGCACACAGCCTCTGTACAGCATCAGTTATTCATG 880 
GTGGAGGTACCGTGGTATTGAGCAGGAAATTCTCACACAA 920 
GAAGTTCTGGCCTGAAGTTGTGGC TTCGGAAGCAAATATC 960 
ATTCAGTACGTTGGTGAATTAGGTCGATATCTCCTGAATG 1000 

1010 1020 1030 1040 

t t t ! , , [ , | | , t ♦ . | > | ■ | ■ ■ 1 I I I ' t ' ' ' 1 1 ' ' 1 ' 1 ■ 

GTCCAAAGAGTCCTTACGACAGGGCCCATAAAGTCCAGAT 1040 
GGCGTGGGGCAATGGC ATGCGTCCAGACGTGTGGGAAGCG 1080 
TTTCGTGAACGCTTCAACATACCAATTATTCATGAGCTCT 20 
ATGCCGCAACCGATGGGCTCGGGTCAATGACCAATCGTAA 160 
CGCGGGCCCTTTTACAGCAAAC TGTATTGCGCTGCGAGGG 1200 

fly X)A 



WO 99/36537 



PCT/US99/00182 



100/117 

chFATP coding only.DNA 

1210 1220 1230 1240 

, - • ■ ' t ' 1 t . » . ■ I t i I 

CTGATCTGGCACTGGAAATTTCGAAATCAGGAAGTGCTGG 1240 
TCAAGATGGATCTCGATACTGATGAGATCATGAGAGATCG 1280 
CAATGGGTTTGCGATACGATGCGCTGTCAATGAACCTGGA 1320 
CAGATGCTTTTTCGGCTGACACCCGAAACTCTGGCTGGTG 1360 
CACCAAGCTACTACAACAACGAAACGGCCACACAGAGCAG 1400 

1410 1420 1430 1440 

■ ■ ■ ■ i .... t .... I ■ r ■■ I l l I i I I I i ■ 1 ■ ■ ■ t 1 

GCGGATTACAGATGTGTTTCAAAAGGGTGACCTGTGGTTC 1440 
AAGTCCGGTGACATGCTACGGCAAGACGCCGAAGGCCGCG 1480 
TCTACTTTGTCGATCGACTAGGCGATACGTTCCGCTGGAA 1520 
ATCCGAAAACGTTTCTACCAATGAAGTCGCGGACGTGATG 1560 
GGCACATTTCCTCAGATTGCTGAAACGAATGTATACGGTG 1600 

1610 1620 1630 1640 

.... t .... \ ... . i .... i ■> . i i I i i t i I i i 'i 1 

TCCTTGTGCCGGGTAACGATGGTCGAGTGCGCAGCCTCAA 1640 
TTGTCATGGCAGACGGCGTGACAGAGTCGACATTCGCTTC 1680 
GCTGCCCTTGCAAAGCACGCCCGAGATCGGTTACCGGGTT 1720 
ATGCTGTACCACTGTTTCTGAGGGTAACTCCAGCACTTGA 1760 
ATATACGGGCACATTAAAGATTCAGAAAGGACGCCTCAAG 1800 

1810 1820 1830 1840 

, ... i ... . » .... i ...,),, .■ I i . i i I 'i t i 1 t ■ i i 1 

CAGGAAGGTATAGACCCAGATAAGATTTCCGGCGAAGATA 1840 
AGTTATACTGGCTGCCGCCTGGTAGCGATATATATTTACC 1880 
ATTTGGAAAGATGGAGTGGCAGGGAATTGTAGATAAGCGT 1920 
ATACGGCTGTGA 1932 



p.. 2o5 



WO 99/36537 



PCT/US99/00182 



101/117 

chFATP coding only.protein 



10 20 30 TO 

. t . . ■ , t , , . . i , ■ i ■ » i i » « I i t i i I i ■ i i i t » i 1 1 



MACMHQAQLYNDLEELLTGP5VP I VAGAAGAAALTAY INA TO 
KYH I AHDLKTLGGGLTOSSEA I OF I NRRVAOKR VLTHH I F 80 
QEQVQKOSNHPFLI FEGKTWSYKEFSEAYTRVANWL I DEL 20 
DVQVGEMVA I DGGNSAEHLMLWLALOA I GAATSFLNWNLT 160 
GAGL I HC I KLCECRFV I AD I D I KAN I EPCRGELEETG tNl 200 
210 220 230 240 



■ .1.... t 



HYYDPSF I SSLPNNTP I PDSRTEN I ELDSVRGL I YTSGTT 2TO 
GLPKGVF I STGRELRTDWS I SKYLNLKPTDRMYTCMPLYH 280 
AAAHSLCTASV I HGGGTV VLSRKFSHKKFWPE V V ASEAN I 320 
I QYVGELGRYLLNGPKSPYDRAHK VQMAWGNGMRPDVWEA 360 
FRERFN IP I IHELYAATDGLGSMTNRNAGPFTANC I ALRG TOO 
410 420 430 440 

, . . « ■ • • • 1 ■ • I I 1 I i • ■ 1 i ' ■ I I ■ ' ■ 1 } 



L 1 WHWKFRNQEVLVKMDLDTDE I MRDRNGFA I RCAVNEPG 440 
QMLFRLTPETLAGAPSYYNNETATQSRR I TOVFQKGDLWF 480 
KSGDMLRQDAEGRVYFVDRLGDTFRWKSENVSTNEVADVM, 520 
GTFPQ I AETNVYGVLVPGNOGRVRSLNCHGRRRORVD I RF 560 
A ALAKHARDRLPG YAVPLFLR VTPALEYTGTLK I OKGRLK 600 
610 620 630 640 

l 1 i - ■ • ' ' f . r . r I , . t t 1 t « . . t 



QEG IOPOK I SGEDKLYWLPPGSD I YLPFGKMEWQG I VDKR 640 
IRL 643 
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aspergillus partial.DNA 



10 20 30 40 

i ■ ■ ■ ■ i ii t*^ «ttii 1 1 t 1 1 ittit 



CTTTACCATTCATCAGCTTCATTCTGCATTTTTAGCTTGA 40 
CGGCAGCCGGGTCTACGCTGATCATCGGCCGCAAGTTCTC 80 
CGCGAGAAACTTCATAAAGGAAGCGCGCGAGAACGACGCC 120 
ACGGTCATCCAGTACGTGGGTGAGACCTTGCGATATCTGC 160 
TCGCCACCCCCGGTGAAACCGATCCAGTTACTGGCGAAGA 200 



210 220 230 240 

i .... f ....».■.■'■■'' I ■ ' » » i i ■ « ■ I 



CCTGGACAAAAAGCACAATATTCGAGCAGTATACGGCAAC 240 
GGGCTACGGCCGGATATCTGGAACCGCTTC AAGGAGCGCT 280 
TCAACGTGCCGACGGTTGCCGAATTTTATGCTGCAACCGA 320 
GAGCCCAGGCGGAACATGGAACTATTCAACAAATGACTTC 360 
ACTGCCGGAGCCATTGGGCACACTGGCGTGCTTAGTGGAT 400 

410 420 430 440 

I , , , ■ ) I ' . ■ i ■ I ■ ■ i » t t . , , t , . r » t 



GGCTTCTTGGACGCGGCCTTACTATTGTCGAGGTGGACCA 440 
GGAATCACAGGAACCATGGCGCGATCCCCAAACCGGGTTC 480 
TGCAAGCCGGTCCCGCGAGGCGAAGCAGGCGAGCTCCTGT 520 
ATGCCATTGATCCGGCCGACCCGGGCGAGACCTTCCAGGG 560 
CTACTACCGCAACTCCTTTAGAGCACACTGGCGGCCG 597 
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103/117 

asperqillus part ial.protein 



10 20 30 40 

I | , - ■ ■ ■ ■ ■ . i • ■ • . i ■ i ■ i I ■ i ' ■ I i ■ i ' I 



LYHSSASFC IFSLTAAGSTL I I GRKFSARNF IKEARENOA 40 
TV I QYVGETLRYLLATPGETOPVTGEDLOKKHN I RAVYGN 80 

g^rp^SnSfkerfnvptvaefyaatespggtwnystnof 120 

TAGA1 GHTGVLSGWLLGRGLT I VEVDOESQEPWRDPQTGF 160 
CKPVPRGEAGELLYAIOPADPGETFOGYYRNSFRAHWRP 199 



Fr<- *3 
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mqFATP partial.DNA 



10 20 30 40 
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GCAAAGGCCGACGCGTGGCTGCGGACGGGTAACGTGATCA 40 
GGGCGGACAACGAAGGGCGACTCTTCTTCCACGACCGGAT 80 
CGGAGACACGTTCCGATGGAAGGGAGAGACNGTCAGCACA 20 
CAAGAGGTCAGTTTGGTGCTCGGACGACACGACTCAATCA 160 
AGGAGGCCAACGTGTACGGCGTGACGGTGCCGAACCACGA 200 

210 220 230 240 

I ■ • ■ ■ I i ■ ■ ■ ' i i i i 1 ' ' 1 1 1 



CGGGCGGGCCGGCTGCGCTGCGCTCACGCTATCAGACGCT 240 
CTGGCGACTGAAAAGAAGCTGGGCGATGAGCTGCTAAAGG 280 
GATTGGCTACTCACTCGTCGACTTCGCTTCCCAAGTTTGC 320 
GGTGCCGCAGTTCCTACGGGTGGTGCGCGGCGAGATGCAG 360 
TCAACGGGCACCAACAAGCAACAGAAGCACGACCTGAGGG 400 

HW 420 430 440 

, . ■ ■ i - ■ ' ■ 

TGCAGGGTGTAGAGCCGGGCAAGGTGGGCGTAGACGAGGT WO 
GTACTGGTTGCGGGGAGGGACATATGTACCATTCGGAACA 480 
GAGGATTGGGATGGGTTGAAGAAGGGTCTTGTGAAGTTGT 520 



GA 522 
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mgFATP partial. protein 



to 20 30 n 

i .... t . i * * ■ ■ ' ■ ' *' ' * * — 



AKADAWLRTGNV IRADNEGRLFFHDR I GDTFRWKGETVST 40 
QEVSLVLGRHDS IKEANVYGVTVPNHDGRAGCAALTLSOA 80 
LATEKKLGDELLKGLATHSSTSLPKFAVPQFLRVVRGEMQ 20 
STGTNKQOKHDLRVQGVEPGKVGVDEVYWLRGGTYVPFGT 160 

EOWOGLKKGLVKL 173 



IS 
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scFATP coding only.DNA . 



10 20 30 40 



ATGTCTCCCATACAGGTTGTTGTCTTTGCCTTGTCAAGGA 40 
TTTTCCTGCTATTATTCAGACTTATCAAGCTAATTATAAC 80 
CCCTATCCAGAAATC ACTGGGTTATCTATTTGGTAATTAT 120 
TTTGATGAATTAGACCGTAAATATAGATAC AAGGAGGATT 160 
GGTATATTATTCCTTACTTTTTGAAAAGCGTGTTTTGTTA 200 

210 220 230 240 

, ■ . . i I I .... I i i i i I i ■ i ■ l ■ i » i 1 

TATCATTGATGTGAGAAGACATAGGTTTCAAAACTGGTAC 240 
TTATTTATTAAACAGGTCCAACAAAATGGTGACC ATTTAG 280 
CGATTAGTTACACCCGTCCCATGGCCGAAAAGGGAGAATT 320 
TCAACTCGAAACCTTTACGTATATTGAAACTTATAACATA 360 
GTGTTGAGATTGTCTCATATTTTGCATTTTGATTATAACG 400 

410 420 430 440 

, i , , - . i » ... i .... I i i « i ' i « i i I i 1 * * 1 • 1 i * I 



TTCAGGCCGGTGACTACGTGGCAATCGATTGTACTAATAA 440 
ACCTCTTTTCGTATTTTTATGGCTTTCTTTGTGGAACATT 480 
GGGGCTATTCCAGCTTTTTTAAACTATAATACTAAAGGCA 520 
CTCCGCTGGTTCACTCCCTAAAGATTTCCAATATTACGCA 560 
GGTATTTATTGACCCTGATGCCAGTAATCCGATCAGAGAA 600 

610 620 630 '640 

, ... t . ■ , , i 1 .... I i ■ i i I i i i i 1 ■ t i i 1 

TCGGAAGAAGAAATCAAAAACGCACTTCC TGATGTTAAAT 640 
TAAACTATCTTGAAGAACAAGACTTAATGCATGAACTTTT 680 
AAATTCGCAATCACCGGAATTCTTACAACAAGACAACGTT 720 
AGGACACCACTAGGCTTGACCGATTTTAAACCCTCTATGT 760 
TAATTTATACATCTGGAACCACTGGTTTGCCTAAATCCGC 800 

810 820 830 840 

, I -...»♦... 1 ■» ■ i I . i t ■ I i ■ ■ ■ I i ' ■ ' » 

TATTATGTCTTGGAGAAAATCCTCCGTAGGTTGTCAAGTT 840 
TTTGGTCATGTTTTACATATGAC TAATGAAAGCACTGTGT 880 
TCACAGCCATGCCATTGTTCCATTCAACTGCTGCCTTATT 920 
AGGTGCGTGCGCCATTCTATCTCACGGTGGTTGCCTTGCG 960 
TTATCGCATAAATTTTCTGCCAGTACATTTTGGAAGCAAG 1000 

1010 1020 1030 1040 



_i_L. 



TTTATTTAACAGGAGCCACGCACATCCAATATGTCGGAGA 1040 
AGTCTGTAGATACCTGTTACATACGCC AATTTCTAAGTAT 1080 
GAAAAGATGCATAAGGTGAAGGT TGCTTATGGTAACGGGC 1 120 
TGAGACCTGACATCTGGC AGGACTTCAGGAAGAGGTTCAA 1 160 
CATAGAAGTTATTGGTGAAT TCTATGCCGCAACTGAAGCT 1200 
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scFATP coding only.DNA 

1210 1220 1230 1240 
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CCTTTTGCTACAACTACCTTCCAGAAAGGTGACTTTGGAA 1240 
TTGGCGCATGTAGGAACTATGGTACTATAATTCAATGGTT 1280 
TTTGTCATTCCAACAAACATTGGTAAGGATGGACCCAAAT 1320 
GACGATTCCGTTATATATAGAAATTCCAAGGGTTTCTGCG 1360 
AAGTGGCCCCTGTTGGCGAACCAGGAGAAATGTTAATGAG 1400 



1410 1420 1430 1440 

. i .... i .... i ...» i .... i 



AATCTTTTTCCCTAAAAAACC AGAAACATCTTTTCAAGGT 1440 
TATCTTGGTAATGCCAAGGAAACAAAGTCCAAAGTTGTGA 1480 
GGGATGTCTTCAGACGTGGCGATGCTTGGTATAGATGTGG 1520 
AGATTTATTAAAAGCGGACGAATATGGATTATGGTATTTC 1560 
CTTGATAGAATGGGTGATACTTTCAGATGGAAATCTGAAA 1600 

1610 1620 1630 1640 

i .... i .... '■■•» ' ■ « t t I . i t r I i t ■ t I 



ATGTTTCCACTACTGAAGTAGAAGATCAGTTGACGGCCAG 1640 
TAACAAAGAACAATATGCACAAGTTCTAGTTGTTGGTATT 1680 
AAAGTACCTAAATATGAAGGTAGAGCTGGTTTTGCAGTTA 1720 
TTAAACTAACTGACAACTCTCTTGACATCACTGCAAAGAC 1760 
CAAATTATTAAATGATTCCTTGAGCCGGTTAAATCTACCG 1800 

1810 1820 1830 1840 
, 1 | 1 I 1 .... i .... 1 



TCTTATGCTATGCCCCTATTTGTTAAATTTGTTGATGAAA 1840 
TTAAAATGACAGATAACCTCATAAAATTTTGA 1872 



WO 99/36537 



108/117 

scFATP coding only.protein 



10 



20 



30 



210 220 ^ u ^7 U 



40 



mo *»2o 1?°. , ih . H " 



, , , | 1 .... > ■ I 1 1 < ■ ■ ■ 1 ■ 1 

SYAMPUFVKFVOEI KMTONL I KF. 624 

fFj. S7 



l_l I » t. 
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mtFATP coding only.DNA 



10 20 30 40 
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GTGTCCGATTACTACGGCGGCGCACAC ACAACGGTCAGGC 40 
TGATCGACCTGGCAAC TCGGATGCCGCGAGTGTTGGCGGA 80 
CACGCCGGTGATTGTGCGTGGGGCAATGACCGGGCTGCTG 120 
GCCCGGCCGAATTCCAAGGCGTCGATCGGC ACGGTGTTCC 160 
AGGACCGGGCCGCTCGCTACGGTGACCGAGTCTTCCTGAA 200 

210 220 230 240 

' I ■ i ■ ■ 1 . i - i 1 i i i i 1 ■ i ■ ■ I i « i i I i » ■ ■ I 

ATTCGGCGATCAGCAGCTGACCTACCGCGACGCTAACGCC 240 
ACCGCCAACCGGTACGCCGCGGTGTTGGCCGCCCGCGGCG 280 
TCGGCCCCGGCGACGTCGTTGGCATCATGTTGCGTAACTC 320 
ACCCAGCACAGTCTTGGCGATGCTGGCCACGGTCAAGTGC 360 
GGCGCTATCGCCGGCATGC TCAAC TACCACCAGCGCGGCG 400 

410 420 430 440 

■ • ■ ■ ' ' ■ ■ t * t i » • 1 ■ ■ ■ ' 1 ■ « ' ■ 1 ■ ' « * 1 » ■ * ♦ ' 1 1 ' ' * 

AGGTGTTGGCGCACAGCCTGGGTCTGCTGGACGCGAAGGT 440 
ACTGATCGCAGAGTCCGACTTGGTCAGCGCCGTCGCCGAA 480 
TGCGGCGCCTCGCGCGGCCGGGTAGCGGGCGACGTGCTGA 520 
CCGTCGAGGACGTGGAGCGATTCGCCACAACGGCGCCCGC 560 
CACCAACCCGGCGTCGGCGTCGGCGGTGCAAGCCAAAGAC 600 

610 620 630 640 

.... t .... I .... y .... I .. i ■ I i ■ i i 1 i i i i I i i i i I 

ACCGCGTTCTACATCTTCACCTCGGGCACCACCGGATTTC 640 
CCAAGGCCAGTGTCATGACGCATCATCGGTGGCTGCGGGC 680 
GCTGGCCGTCTTCGGAGGGATGGGGCTGCGGCTGAAGGGT 720 
TCCGACACGCTCTACAGCTGCCTGCCGCTGTACCAC AACA 760 
ACGCGTTAACGGTCGCGGTGTCGTCGGTGATC AA7TCTGG 800 

810 820 830 840 

• ... i .... i ... . I i ■ i ' ' i » ■ ■ 1 ■ ■ ■ ' » i ■ ' ■ I 

GGCGACCCTGGCGCTGGGTAAGTCGTTTTCGGCGTCGCGG 840 
TTCTGGGATGAGGTGATTGCCAACCGGGCGACGGCGTTCG 880 
TCTACATCGGCGAAATCTGCCGTTATCTGCTCAACCAGCC 920 
GGCCAAGCCGACCGACCGTGCCCACCAGGTGCGGGTGATC 960 
TGCGGTAACGGGCTGCGGCCGGAGATCTGGGATGAGTTCA 1000 

1010 1020 1030 1040 

■ . ■ , | i .... i .. .. i .... i .... t t .... i 

CCACCCGC TTCGGGGTCGCGCGGGTGTGCGAG7TCTACGC 1040 
CGCCAGCGAAGGCAACTCGGCCTTTATCAACATCT TCAAC 1080 
GTGCCCAGGACCGCCGGGGTATCGCCGATGCCGCTTGCCT 1 120 
TTGTGGAATACGACCTGGACACCGGCGATCCGCTGCGGGA 1 160 
TGCGAGCGGGCGAGTGCGTCGGGTACCCGACGGTGAACCC 1200 
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mtFATP coding only.DNA 



1210 1220 1230 1240 
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GGCCTGTTGCTTAGCCGGGTCAACCGGCTGCAGCCGTTCG 1240 
ACGGCTACACCGACCCGGTTGCCAGCGAAAAGAAGTTGGT 1280 
GCGCAACGCTTTTCGAGATGGCGACTGTTGGTTCAACACC 1320 
GGTGACGTGATGAGCCCGCAGGGCATGGGCCATGCCGCCT 1360 
TCGTCGATCGGCTGGGCGACACCTTCCGCTGGAAGGGCGA 1400 

1410 1420 1430 1440 

i 1 i i i i I ■ • ■ ■ ' I , , , , I i ■ i i t . i i t I 



GAATGTCGCCACCACTCAGGTCGAAGCGGCACTGGCCTCC 1 440 
GACCAGACCGTCGAGGAGTGCACGGTCTACGGCGTCCAGA 1480 
TTCCGCGCACCGGCGGGCGCGCCGGAATGGCCGCGATCAC 1520 
ACTGCGCGCTGGCGCCGAATTCGACGGCCAGGCGCTGGCC 1 5oO 
CGAACGGTTTACGGTCACTTGCCCGGCTATGCACTTCCGC 1600 

1610 1620 1630 1640 

i ll' 1 1 , I , , , . 1 . i ■ t 1 t » » i t i » « ' 1 — — 

TCTTTGTTCGGGTAGTGGGGTCGCTGGCGCACACCACGAC 1640 
GTTCAAGAGTCGCAAGGTGGAGTTGCGCAACCAGGCCTAT 1680 
GGCGCCGACATCGAGGATCCGCTGTACGTACTGGCCGGCC 720 
CGGACGAAGGATATGTGCCGTACTACGCCGAATACCCTGA 1760 
GGAGGTTTCGCTCGGAAGGCGACCGCAGGGCTAG 1794 

R3. m 
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mtFATP coding only.protein 



10 20 30 n 
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810 220 , 23° , 

5SSiS;ii!K.K?"l!S?Mn.ROA!«vi>RVPo«EP «o 

.. . ~ /ion 430 W 



410 420 
i 1 ■ ... I ■■■ t I 1 



430 440 
i i 1 i t i i I i i > i 1 



^^EsSBSKK3S33p^S 
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hsFATP3 

1 cga ccc acg cgt ccg ggg acg ccc gcg age ggc egg aac cag acg gcg ccg ata gag gaa 
1 MFASGWNQTVPI EE 

61 gcg ggc ccc acg gec gec etc ctg ctg ctg ccc ccg ccg ccg ccg cca ccg ccg ccg ccg 

15 AG'SMAALLLLPLLLLLPLLL 

121 ccg ccg aag cca cac ccc egg ccg cag ccg cgc egg ccc ccg gcg gac ctg gec ttt gcg 

35 LLKLKLWPQLRWLPADLAFA 

181 gcg cga get ccg cgc tgc aaa agg get ccc cga gec cgc gec ccg gec gcg get gec gec 

55 VRALCCKRALRARALAAAAA 

241 gac ccg gaa ggt ccc gag ggg ggc tgc age ctg gee tgg cgc etc gcg gaa ctg gee cag 

75 DPEGPEGGCSLAWRLAELAQ 

301 cag cgc gec gcg cac ace ttt etc att cac ggc teg egg cgc cct age tac tea gag gcg 

95 QRAAHTFL I HGSRRFSYSEA 

361 gag cgc gag age aac agg gec gca cgc gec ccc cca cgc gcg cca ggc egg gac egg gga 

115 ZRESNRAARAFLRALCWDWG 

421 ccc gac ggc ggc gac age ggc gag ggg age gee gga gaa ggc gag egg gca gcg ccg gga 

13S PDGGDSGEGSAGEGERAAPG 

481 gec gga gac gca gcg gee gga age ggc gcg gag ccc gee gga ggg gac ggc gec gee aga 

15S A G D A A A G SGAEFAGGDGAAR 

541 ggc gga gga gag ccc gee gee ccc ccg tea ccc gga gca acc gcg gcg ccg ccc etc ccc 

175 GGGEPAA PLSPGATVAL.LLP 

601 gee ggc cca gag ccc ccg tgg etc tgg ttc ggg ccg gee aag gee ggc ccg cgc act gee 

195 AGPEFLWLWrGLAKAGLRTA 

661 cct gtg ccc acc gee ctg cgc egg ggc ccc ccg ctg cac tgc ccc cgc age cgc ggc gcg 

21S FVPTALR RGPLLHCLRSCGA 

721 cgc gcg ccg gcg ccg gcg cca gag ccc ccg gag ccc ccg gag ccg gac ccg ccc gee ccg 

235 RALVLAPEFLESLEPDLPAL 

7B1 aga gee acg ggg ccc cac ccg egg get gca ggc cca gga acc cac cct gec gga acc age 

255 RAMGLHLWAAG PGTH PAG IS 

841 gac ccg ccg gee gaa gcg ccc gec gaa gcg gac ggg cca gcg cca gga cac ccc tec ccc 

275 DLLAEVSAEVDGPVPCYLSS 

901 ccc cag age aca aca gac acg cgc ccg cac acc ccc acc ccc ggc acc acg ggc ccc ccc 

295 PQSITDTCLY I FTSGTTGLP 

961 aag gec gec egg acc age cac ccg aag acc ccg caa cgc cag ggc tec tat cag ccg tgt 

315 KAARISHLKILQCQGFYQLC 

1021 ggt gtc cac cag gaa gat gcg acc cac ccc gee ccc cca ccc cac cac atg ccc ggc ccc 

335 GVHQEDV I YLALPLYHMSGS 

1081 ccg ccg ggc acc gcg ggc cgc acg ggc ace ggg gec aca gcg gcg ccg aaa tec aag ttc 

355 LLGIVGCMCIGATVVLKSKF 

1141 teg get ggt cag ttc egg gaa gac cgc cag cag cac agg gcg acg gcg ccc cag cac acc 
375 SAGQFWEDCQQHRVTVFQY I 

1201 ggg gag ctg tgc cga tac etc gee aac cag ccc ccg age aag gca gaa cgc ggc cac aag 
395 GELCRYLVNQPPSKAERGHK 
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1261 gee egg ccg gca gcg ggc age ggg ccg cgc cca gac ace egg gag cgc ccc gcg egg cgc 

41S VRLAVGSGLRPDTWERFVRR 

1321 etc ggg ccc ctg cag gcg ccg gag aca tac gga ccg aca gag ggc aac gcg gec acc acc 

435 FGPLQVLETYGLTBGNVATI 

1381 aac cac aca gga cag egg ggc gec gcg ggg cgc gee ccc egg ccc cac aag cac ate ccc 

455 NYTGQRGAVGRASWLYKHIF 

1441 ccc ccc ccc ccg acc cgc tat gac gec acc aca gga gag cca acc egg gac ccc cag ggg 

475 ppsLlRYDVTTGEPI RDPQG 

1501 cac cgc aeg gee aca ece cca gge gag cca ggg ccg ctg gtg gec ccg gea age cag cag 

495 HCMATSPGEPGLLVAPVSQO 

1561 ecc cca etc ccg ggc cac gee ggc ggg cca gag ccg gec cag ggg aag ccg cea aag gac 

515 SPFLGYAGGPELAQGKLLKD 

1621 gee CCc egg ccc ggg gac gec ccc ccc aac acc ggg gac ccg ccg gee cgc gae gac caa 

535 VFRPGDVrFNTGDLLVCDDQ 

1681 ggc tec ccc cgc ccc cac gac cgc ace gga gac acc cec agg egg aag ggg gag aac gcg 

555 GFLRFHDRTGDTFRWKGENV 

1741 gee aca acc gag gcg gca gag gtc ttc gag gec eta gat etc ccc cag gag gcg aac gec 

575 ATTEVAEVFEALDFLQEVNV 

1801 tac gga gec acc gcg cca ggg cac gaa ggc agg gec gga acg gca gee cea gec ccg cgc 

595 VGVTVPGHEGRAGMAALVLR 

1861 ccc ccc cac gee eeg gac ccc acg cag cec cac acc cac gcg ccc gag aac teg cca ece 
615 PPHALDLMQLYTHVSENLPP 

1921 cac gec egg ccc cga cec ccc agg cec cag gag ece ccg gec acc aca gag acc cec aaa 
635 YarprfLRLQESLATTETFK 

1981 cag cag aaa gec egg aeg gca aac gag ggc cec gac ccc age acc ceg ccc gae cca ctg 
655 QQKVRMANEGFDPSTLSDPL 

2041 cac gee ctg gac cag get gta ggt gec cac ccg ccc ccc aca acc gec egg cac age gee 
675 YVLDQAVGAYLPLTTARYSA 

2101 cec ccg gca gga aac cec cga acc cga gaa cec cca cac ccg agg cac ccg aga gag gaa 
695 LLAGNLRI 

2161 ccc cgc 




